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Preface 


This two-volume book was written for students of technical colleges 
who have had the usual mathematical training. It contains just enough in- 
formation to continue with a wide variety of engineering disciplines. It 
covers analytic geometry and linear algebra, differential and integral cal- 
culus for functions of one and more variables, vector analysis, numerical 
and functional series (including Fourier series), ordinary differential equa- 
tions, functions of a complex variable, Laplace and Fourier transforms, 
and equations of mathematical physics. This list itself demonstrates that 
the book covers the material for both a basic course in higher mathematics 
and several specialist sections that are important for applied problems. 
Hence, it may be used by a wide range of readers. Besides students in techni- 
cal colleges and those starting a mathematics course, it may be found useful 
by engineers and scientists who wish to refresh their knowledge of some 
aspects of mathematics. 

We tried to give the fundamental material concisely and without dis- 
tracting detail. We concentrated on the presentation of the basic ideas of 
linear algebra and analysis to make it detailed and as comprehensible as 
possible. Mastery of these ideas is a requirement to understand the later 
material. 

The many examples also serve this aim. The examples were written to 
help students with the mechanics of solving typical problems. 

More than 600 diagrams are simple illustrations, clear enough to 
demonstrate the ideas and statements convincingly, and can be fairly easily 
reproduced. 

We were conscious not to burden the course with scrupulous proofs 
for theorems which have little practical application. As a rule we chose 
the proof (marked in the text with special symbols) that was constructive 
in nature or explained fundamental ideas that had been introduced, show- 
ing how they work. This approach made it possible to devise algorithms 
for solving whole classes of important problems. 

In addition to the examples, we have included several carefully selected 
problems and exercises (around 1000) which should be of interest to those 
pursuing an independent mathematics course. The problems have the form 
of moderately sized theorems. They are very simple but are good training 
for those learning the fundamental ideas. 
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Chapters 1-6, 26 and Appendix II were written by E. Shikin, Chapters 
7-8, 11, 12, 17-21, 27, 28 and 29-32 by M. Krasnov, Chapters 9, 10, 13-16 
by A. Kiselev, and Chapters 22-25 and Appendix I by G. Makarenko. There 
was no general editor, but each of the authors has read the chapters written 
by the colleagues, and so each chapter benefited from collective advice. 


The Authors 


Chapter 1 


An Introduction to Analytic Geometry 


1.1 Cartesian Coordinates 


Coordinate axis. Let L be an arbitrary line. We may move along 
L in either of two directions. When the direction of moving is fixed, the 
line is said to be directed. 

Definition. A directed line is called an axis. 

The direction of an axis is indicated by an arrow (Fig. 1.1). 

We fix on the axis L a point O and a line segment a of a unit length, 
called a unit distance (Fig. 1.2). Let M be a point on L. We associate with 
M a number x such that the value of x is equal to the positive distance 
between O and M if the direction of moving from O to M coincides with 
the direction of L, and to the negative distance otherwise (Fig. 1.3). 








L 
Fig. 1.1 
ЕНЕС АЗЕР 
— eo і 
0 L 
Fig. 1.2 
r 
Lá. x 
0 M Lj 
Fig. 1.3 Fig. 1.4 


Definition. The axis L with the reference point O and the unit distance 
a given on it is called the coordinate axis; the number x as defined above 
is said to be the coordinate of M. 

In symbols, we write M(x) to designate a point M whose coordinate is x. 

Cartesian coordinates in a plane. Let O be a point in a plane. We draw 
through О two mutually perpendicular lines L1 and /2. Let us choose a 
direction for each line and a unit distance a which is the same for Lı and 
L2. Then Їл and L2 become coordinate axes with a common reference point 
O (Fig. 1.4). 
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We call one of the coordinate axes the x-axis or the axis of abscissas 
and the other one the y-axis or the axis of ordinates (Fig. 1.5). The point 
O is called the origin of coordinates. 

Let M be a point in a plane as shown in Fig. 1.6. We drop from M 
two perpendiculars onto the coordinate axes, the points М, and M, being 
the projections of M on the x- and y-axes and associate with M an ordered 
pair (x, y) of numbers so that x is the coordinate of the point M, and 
y is the coordinate of the point Mọ. 
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The numbers x and y are called the Cartesian coordinates of M, x being 
the abscissa and y being the ordinate. 

In symbols, we write M(x, y) to designate a point M whose coordinates 
are x and y. 

For short, we shall refer to the frame of reference given above as a Carte- 
sian coordinate system set up in a plane. 

The coordinate axes divide a plane into four parts called quadrants. 
These are numbered as shown in Fig. 1.7 and the accompanying table. 

Remark. The unit distances may be different for the two axes. Then 
the coordinate system is called rectangular. 
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Cartesian coordinates in three-dimensional space. Let O be a point in 
three-dimensional space. We draw through O three mutually perpendicular 
lines £1, L2 and Їл. We choose a direction for each line and a unit distance 
which is the same for Li, L2 апа Із. Then Lı, L2 and L; become coordinate 
axes with a common reference point O (Fig. 1.8). 

We call one of the axes the x-axis or the axis of abscissas, the second 
one the y-axis or the axis of ordinates, and the third one the z-axis or the 
axis of applicates. The point O is called the origin of coordinates (Fig. 1.9). 





Fig. 1.8 Fig. 1.9 





Fig. 1.10 


Let M be an arbitrary point in three-dimensional space as shown in 
Fig. 1.10. We drop from M three perpendiculars onto the coordinate axes, 
the points Му, Му and M; being the projections of M on the x-, y- and 
z-axes and associate with M an ordered triple (x, y, z) of numbers, so that 
x is the coordinate of the point М;, y is the coordinate of the point М 
and z is the coordinate of the point Mz. 
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The numbers x, y and z are said to be the Cartesian coordinates of 
M; x, y and z are called the abscissa, ordinate and applicate of the point 
M, respectively. 

In symbols, we write M(x, y, z) to designate a point M whose coor- 
dinates are x, y and z. 

Thus we have set up a Cartesian coordinate system in three-dimensional 
space. 

Definition. A plane determined by any pair of coordinate axes is called 
a coordinate plane. 

There are three coordinate planes in three-dimensional space, namely 
the xy-plane, the xz-plane and the yz-plane. These planes divide the space 
into eight parts called octants. 


12 Elementary Problems of Analytic Geometry 


Distance formulas. Let Mi(xi) and M202) be points on a coor- 
dinate axis. Then the distance d between Mi(x1) and М(х) is given by 


d= ам, M3) = lx. Fu: xi|. 


Let there be given a Cartesian coordinate system in an xy-plane. Then 
the distance between any two points М(х, yı) and М(х, у) is given by 


d = d(Mi, М) = Мо — iy + (о-и). 








KY 








ча: Consider the right triangle ММ,М» (Fig. 1.11). The theorem of 
Pythagoras gives |MiMo|? = |MiM|? + |ММЬГ. Since the distance between 
М, and М» equals the length of the segment М.М and |MiM| = |x2 — xıl, 
ММЬ| = |» — xij, we have 


Ф = о - xl? + | - )1 


Notice that x: — х1]? = (2 – x and |у — xi? = (уь — yı}. Then 
extracting the square root from 42, we get the desired formula. P 


K 
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Remark. In three-dimensional space the distance between Mi(xi, Эл, zi) 
and M202, J2, z2) is 


d = d(M,, M) = V% — xy + (2 — x, + (a — 21)’. 


(Show this.) 

Examples. (1) Write the equation of a circle with radius r and centre 
at the point P(a, b). 
ча Let M(x, y) be a point of a circle (Fig. 1.12). Then |MP| = r. Since 
|MP| is the distance between M and P we have 


IMP| = ғ = У(х – ay + (у - by. 
Squaring this equation, we get 
(x - a + (у - by =r. 
This is the desired equation of a circle. 9» 
(2) Let Fi( — c, 0) and F;(c, 0) be points in a plane and а (a > c 20) 
be a given number. Find the condition to be satisfied by coordinates 


(x, y) of a point M for the sum of the distances between M and F; and 
between M and Р, to be equal to 2a. 





Fig. 1.12 Fig. 1.13 


за Let us find the distances between M and F; and between M and F, 
(Fig. 1.13). We have 


МЕ] = Nx + c? +y? and |МЕ| = Nx - o? + X. 
Whence 
Gy «cosy = 24. 


Transpose the second radical to the right: 


М(х + cy + у? = 2а – М(х – ey + у>. 
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Then, squaring and simplifying both sides of the equation, we get 
а\ (х — с)? + у? = а? ~ сх. 


Squaring and further simplifying both sides of the above equation, we 
obtain 
(а? — с2)х? + а?у? = а?а? - с?). 
Setting b? = а? — с? and dividing both sides by a?b?, we arrive at the 
equation of an ellipse (see Chap. 4) 
2 2 
x У 
Бык ctu. 
a’ b 
This is the condition we have sought for. » 
Division of a line segment in a given ratio. Let Mi(xi, yi) and 
M»3(x», у) be two distinct points in a plane. Let a point M(x, y) lie on 
the line segment MiM» and divide М.М» in the ratio №:№, i.e., 


.|.: MiM] m M 


MM] M` 


Represent the coordinates (x, y) of M in terms of the coordinates of 
Mı and Mz and the numbers № and №. 








Fig. 1.14 


* Suppose that the segment is not parallel to the y-axis (Fig. 1.14). Then 


IMM| — |MuM;| 
|\ММ» | М.М, i 


Since (ММ = þa — x| and |М,Мь | = |x — x2| we have 





a- № 
|x — | № ` 
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The point M lies between М, and Mz. Hence, there holds either 
x < х< хо ог ху > x > x. This implies that the differences х — x and 
х — x; are always of the same sign. Thus we may write 
м-х X 


xX — X% № ` 
Hence 
x n + м (*) 
Е м + № : 


When the segment М,М, is parallel to the y-axis, we have x1 = x» = x. 
Notice that this result immediately follows from (ж) if we set x1 = x». 
Similar reasoning yields 


2 №у + Му» 
м + de 


Example. Find the coordinates of the centre of gravity M of the triangle 
with vertices at М(х, у), №00, у) and Мз(хз, уз) (Fig. 1.15). 


Ms 


Fig. 1.15 


ча Recall that in any triangle the centre of gravity and the point of inter- 
section of medians coincide so that M divides each median in the ratio 
2:1 reckoning from the corresponding vertex. Thus, the coordinates of M 
are 
1x3 + 2x’ © ly3 + 2y’ 
poe M XT 
where x’ and y’ are the coordinates of the point М’ of the median 44M’. 
Since M’ is the mid-point of MiM5, we have 


E 1м + Ix "M ln + 1» 
Tg ME qu 


Substituting these relations into the formulas for x and y, we arrive 
at the desired result 
хл t X + 23 л + у» Уй 


Eel and y= 3 


ГА 


» 


3 9505 
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Remark. Let M(x, y, z) divide a segment joining Mi(xi, yi, 21) and 
M202, y», 22) in the ratio №: №. Then 


ixi + № = Му + Му» 2 MZ + Az 





utk raa ct eT uik 


1.3 Polar Coordinates 


Consider an axis Z in a plane and a point O on L (Fig. 1.16). 

Let M be a point distinct from O as shown in Fig. 1.17. The number r 

is the distance between O and M and ¢ is the angle between the positive 

direction of L and OM measured counterclockwise. We easily see that the 

position of M in the plane is uniquely defined by the values of r and с. 
M 





Fig. 1.16 Q Fig. 1.17 


The ordered pair (r, к) of numbers represents the polar coordinates 
of M. The numbers r and e are called the polar radius and the polar angle, 
respectively. E 

The point О is referred to as the pole and the axis L as the polar axis. 

It is clear that г> 0 and 0 < o < 2r. 

When M coincides with the pole, r = 0. In this case the polar angle 
is not defined. 

Therefore we may set another coordinate system in a plane, namely the 
polar coordinate system. 

The Cartesian coordinate system is said to be compatible with a given 
polar system if the origin O is the pole, the x-axis is the polar axis and 


the y-axis makes an angle of +5 with the x-axis (Fig. 1.18). Then the 


relations between the Cartesian coordinates and the polar coordinates are 


given by the formulas: 
cos y = Ý and sing =~, 
x-2rcose and y=rsing, 


where r= Vx? +y. 
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Fig. 1.18 Fig. 1.19 


Example. Let R (R > 0) be a given number. The set of points whose 
polar coordinates (r, o) satisfy the condition 
r=R 
is a circle with radius R and centre at the pole (Fig. 1.19). 


14 Second- and Third-Order Determinants 


We are given four numbers 011, di2, 021, a2 (these are read a 
one-one, @ one-two, @ two-one, @ two-two). 
The second-order determinant is the number 


411422 — 412021. 
In symbols, we write 


ајр 412 


A= = 1102 — 012021 = det |а| (/ = 1, 2; j = 1, 2) 


(1.1) 








Gi 022 


to designate the second-order determinant. 

The numbers алт, 812, 2 and az are called elements of A. The pairs 
а, d12 and d21, 222 of elements are referred to as the first and the second 
rows of A, respectively; аш, @1 and ал, a2? as the first and the second 
columns of A, respectively; аш, @22 as the principal (or positive) diagonal 
of A; and aj, a1 as the secondary (or negative) diagonal of A (see 
Fig. 1.20). 


Fig. 1.20 
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Therefore the value of the second-order determinant is equal to the 
product of elements on the principal diagonal minus the product of ele- 
ments on the secondary diagonal. 

Example. Compute the value of 








-1 3| 
-2 4| 

ч Using (1.1), we get 
ЕТЕ te 
-2 4 








We encounter the second-order determinants when dealing with systems 
of linear equations in two unknowns 




















s + apy = bı 22 
ах + any = b |—а\; 
А 411 012 e А Ё 
Provided that A — # 0, eliminating the unknowns gives 
421 4922 
_ bian - ba andes ba — Бїйгл 
411422 — 012021 411422 - 412021 
or, as quotients of determinants, 
b ao ar b 
b n Qi b 
х= + and y = ——————.. 
ап 412 апл d 
421 422 21 822 














Let there be given nine numbers а; (i= 1, 2, 3; j = 1, 2, 3). 
The third-order determinant 


4811 412 013 
А = 14: 42 Фз| = det lajl 
аз 032 033 


is the number whose value is computed as 


A = di1d22033 + 012023031 + 021032013 
— 41302243) — 021012033 — 011023432. (1.2) 
The first subscript i of aj; refers to the row, the second subscript j refers 
to the column. 
The triples of elements a11, 222, 433 and аз, 222, 231 are called the prin- 
cipal (or positive) diagonal and the secondary (or negative) diagonal of 
A, respectively. 
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To tackle a question of signs in (1.2) we mention that the three positive 
terms @11, d22, d33; d12, 023, d31 and G21, d32, аз correspond to the principal 
diagonal and to the vertices of the two triangles whose bases are parallel 
to the principal diagonal. Similarly, the secondary diagonal and the vertices 
of the two triangles whose bases are parallel to the secondary diagonal cor- 
respond to the three negative terms in (1.2), as shown in Fig. 1.21 and 
Fig. 1.22. 


Fig. 1.21 Fig. 1.22 


This consideration lays down a convenient procedure for computing the 
value of the third-order determinant, called the rule of a triangle. 
Example. Compute the value of 


1 0 -i 
А = |2 4 3 
3 -1 6 


ча The rule of a triangle gives 
1-1х4х6-(-1х2х(-140х3х3-(-1х3х4 
-0хХх2х6-1х(-1х3-24-240-4112-043-4» 


Now we shall turn our attention to some properties of the second- and 
the third-order determinants easily verified by applying formulas (1.1) and 
(1.2). 

Properties of determinants. (1) The value of the determinant is un- 
changed if the rows and the columns are transposed, 16., 














811 412 811 021 

— РД 

@1 022 8412 022 
11 Gdi2 аз @1 G1 а 
Ф] 022 43) = (14:2 022 032|. 
a31 032 033 із 023 033 


(2) The determinant reverses its sign if any two rows (or any two 
columns) are interchanged. 
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(3) The factor common to all elements of some row (or some column) 
may be taken outside the determinant, i.e., 


kay, kan Каз dà Ка а\з 411 412 а\3 
821 222 3| = |a: kan 43|-4К(|фл a2 оз 
831 d» 03 азі Каз? 03 азі 032 033 


The following three properties are consequences of Properties 1-3. We 
can also verify these properties by directly applying formulas (1.1) and (1.2). 

(4) The determinant is zero if it contains two identical rows (or two 
identical columns). 

(5) The determinant vanishes if all elements of some row (or some 
column) are zeros. 

(6) The determinant is zero if it contains two proportional rows (or two 
proportional columns). 

We shall outline another approach to computation of the third-order 
determinant 


а 4812 аз 
A= |@1 @2 5|. 
аз (032 033 
The minor Mi; of A is the second-order determinant obtained by delet- 
ing from A the row and the column which intersect in aij. 
For example, the minor M23 of A is the second-order determinant 
@1 412 
431 032 


Мз = 








The cofactor Aj; of the element aj in A is the minor M; if the sum 
(i + j) of the numbers of the ith row and the jth column is even, and the 
minor Mi; multiplied by (—1) otherwise, i.e., 
Ay = (C7 1! * My, 
where (— 1)'* is the position sign. : 
Theorem 1.1. The third-order determinant A is equal to the sum of 


products of elements in some row (or some column) of A multiplied by 
their respective cofactors so that the following expansions 


A = аА + anA + азАа (i = 1, 2, 3), (1.3) 
A = аА + 02142) + d3jAs; ( = 1, 2, 3) (1.4) 


hold. 
By way of illustration we show that 


А = tiA + 012412 + 03413. 
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ч Applying formula (1.3), we obtain 
A = а11(022033 — @32423) + 12(423431 — 421433) 4:413(4210432 — 031022) 


G1 023 Фі 0» 


аз d33 


422 (923 
ai 




















432 033 Q31 032 


= @ Mii — 0:2М12-4 813 М13 = aA + 2А + ann. В 


Formulas (1.3) and (1.4) represent the expansion of the determinant with 
respect to the ith row and the expansion of the determinant with respect 
to the jth column, respectively. 

Example. Compute the value of 


1 0 -1 
A-|2 4 3]. 
3 -1 6 
ч. Expanding the determinant with respect to the first row, we get 
€ 43| 0 25731: 1 2 4 
-1 6 3 6 3 -1 











= (24 + 3) +0 + (2 + 12) = 41. » 


Chapter 2 


Elements of Vector Algebra 


2. Fixed Vectors and Free Vectors 


Let A and B be two distinct points. We may move along a line 
segment joining A and B in either of two directions. Suppose we start mov- 
ing at A and end at B. This gives rise to a directed segment AB. When 
moving from B to A, we get a directed segment BA. 

Directed segments are frequently called fixed vectors. 

The direction of a fixed vector (or a directed segment) is indicated pic- 
torially by an arrow (Fig. 2.1). 

We denote a fixed vector by a symbol, say AB, with the understanding 
that the first letter in a pair refers to the initial point of the fixed vector 
and the second letter to the terminal point. 

A fixed vector is said to be zero if its initial and terminal points coincide. 


D 
і 
l 
| 
I 
ыг? 
| А. TB 
— B — B | E 
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Definition. The fixed vectors AB and CD are said to be equivalent if 
the mid-points of the segments AD and BC coincide (Fig. 2.2). 


In symbols, we write AB = cB to signify the fact that AB and CD 
are two equivalent fixed vectors. 

Notice that when AB and CD do not lie in the same line, we may in- 
troduce a definition equivalent to the given above, namely AB and CD are’ 


2.1 Fixed Vectors and Free Vectors | | 25 





equivalent if the quadrilateral ABCD is a parallelogram. Hence all equiva- 
lent fixed vectors are of the same length. 

By way of illustration we consider the square and the fixed vectors 
shown in Fig. 2.3. It is clear that AB and DÓ are equivalent while BC 
and DA are not. 

Equivalent fixed vectors obey the following laws. ! 

0) Any fixed vector is equivalent to itself, i.e., AB = AB. 

(2) If AB = CD then CD = AB. 


(3) If AB = CD and CD = EF then AB = EF. 


D C 


Fig. 2.4 





Fig. 2.5 


Let AB be a given fixed vector and C be an arbitrary point. By virtue 
of the above definition we can always find a point D (Fig. 2.4) such that 


CD = AB. 


Hence, to any point we may apply a fixed vector equivalent to the given one. 

Now we shall turn our attention to free vectors, ie, to vectors which 
may be made to start at an arbitrary point. In other words, a free vector 
is a vector which may be moved rigidly parallel to itself. 

We clearly see that a given fixed vector AB uniquely defines a free vector 
which starts at A and ends at B. 

Free vectors whose initial points lie on the line determined by a given 
(nonzero) fixed vector are thought of as sliding vectors (Fig. 2.5). 

Fixed vectors and sliding vectors are widely used in theoretical 
mechanics. 
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We shall denote free vectors by bold face small letters a, b, c, ... and 
the zero vector by 0. 
Let a be a free vector and A be a point. Then there exists only one 
point B such that 


AB = а. 


Hence, the operation of moving a given free vector to a specific point gives 


rise to a unique fixed vector AB. This operation is referred to as applying 
a free vector a to a point A. 


A 
Fig. 2.6 


Notice that applying a given free vector to different points, we obtain 
fixed vectors which are equivalent and, consequently, of the same length. 
Therefore we may speak of the length of a free vector, setting it equal to 
the length of the corresponding fixed vector. The length of the zero vector 
is equal to zero. We shall denote the length of a free vector a by jaj. 

Notice that if a = b then |а| = |b| and the converse is not true. 


2.2 Linear Operations on Vectors 


Addition of vectors. Let there be given two vectors a and b. Apply 
— 
the vector a to some point O so that OA = a and then apply the vector 


b to the point A, as shown in Fig. 2.7. The resultant vector OB is the sum 
of a and b denoted by a + b. This rule of addition of two vectors is known 
as the triangle law. 


Fig. 2.7 Fig. 2.8 
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It is easy to see that addition of two vectors obeys the commutative 
law, ie, there holds 


a+b=b+a. 
for any two vectors a and b (Fig. 2.8). 
Apply a and b to some point O and construct a parallelogram with 


sides a and b, as shown in Fig. 2.9. Then the vector OB which starts at 
О and ends at the vertex opposite іо О represents the sum a + b (orb + a). 
This rule of addition is called the parallelogram law. 





Fig. 2.9 Fig. 2.10 


Now we consider three vectors a, b and c and proceed as follows. First, 
we apply a to some point O so that OÀ - a. Then we apply b to A so 
that AB — b. And finally, we apply c to B so that BC = c. Thus we obtain 
a polygonal line of three vectors as shown in Fig. 2.10. 

Using the triangle law, we obtain OB — a 4 b and OČ = (a 4 b) +с 
(Fig. 2.11). 

On the other hand, we have AC —- b c and 00-а-ч (b + c) 
(Fig. 2.12). Hence, we conclude that for any three vectors a, b, c there holds 


(a+b)+c=a+ ( + с). 


Thus addition of three vectors obeys the associative law and we may 
write the sum of a, b and c as 


а+е +0. 





(а+Ы) +С 





Fig. 2.11 Fig. 2.12 
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Fig. 2.13 


Similarly, we define the sum of any number of vectors as a vector which 
starts at the initial point of the first summand and ends at the terminal 
point of the last summand as depicted in Fig. 2.13 for seven vectors. The 
sum of seven vectors is 


а= a + а; + аз + 4 + а; + 36 + 87. 


This rule of addition may be referred to as the /aw of closure of a poly- 


gonal line to complete a polygon. 
Example. Find the sum of six vectors each of which starts at the centre 


and ends at a vertex of a regular hexagon (Fig. 2.14). 





Fig. 2.14 Fig. 2.15 


« The law of closure of a polygonal line yields 
81-32-25 +a +a + 25 = 0. > 


Multiplication of a vector by a scalar. Definition. Free vectors а and 
b are said to be collinear if their corresponding fixed vectors lie on parallel 
lines (Fig. 2.15) or on the same line. 5 
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In symbols, we write all b to denote collinear vectors. 

Remark. It follows from the definition that a and b are always collinear 
when either a or b is a zero vector. 2: 

Let a and b be two collinear vectors applied to a point О so that ОА = a 
and OB = b. Then the points O, A and B lie on the same line. The point 
O may lie either outside the segment joining A and B or between A and 
B as shown in Fig. 2.16. The vectors a and b are of the same direction 
in the first case and of the opposite directions in the second one. 

The vectors a and b are said to be equivalent if they are of the same 
length and of the same direction. 

Let a be a vector and à be a scalar. 

Definition. The product of a vector a by a scalar ^ is the vector b such 
that 

(a) |b] = |А: lal; 

(b) a and b are of the same direction if \ > 0 and of opposite directions 
when à < 0. 

In symbols, we write b = ^a. 

When ^ = 0 we get ^a = 0. 


a 
oe e aa au a? 
О А В А 0 B 
Fig. 2.16 Fig. 2.17 


Therefore the vectors a and b = )a are collinear by definition. The con- 
verse is also true, namely if a (а ~ 0) and b are collinear vectors, then 
there exists a scalar А such that b = `a. 

Basic properties of multiplication of a vector by a scalar are 

(1) ( + рја = da + ра. 

(2) Миа) = Qua. 

(3) Жа + b) = ^a + №. 

(Here X and p are any real numbers, and a and b are any vectors.) 

Definition. A vector a? of a unit length (|a?| = 1) is called a unit vector. 

If a # 0 then the vector 


0 а 


1 
lal al 


is a unit уесіст of the same direction as а (Fig. 2.17). 
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2.3 Coordinates and Components of a Vector 


Let a Cartesian coordinate system with the origin O be given in 
three-dimensional space. 

Given a point M in the space, we may find three points P, Q and R 
called projections of M onto the coordinate axes (Fig. 2.18). Conversely, 
given three points P, Q and R on the coordinate axes, there exists a unique 
point M such that P, Q and R are projections of M onto the coordinate 
axes. Hence, the position of M in the space is uniquely defined by projec- 
tions P, Q and R of M onto the coordinate axes and vice versa. 

Recall that positions of P, Q and R on the coordinate axes are identified 
by their respective coordinates x, y and z. There is a one-to-one correspond- 
ence between M and the ordered triple (x, y, z) of numbers called the Carte- 
sian coordinates of M. 





Fig. 2.18 Fig. 2.19 


Let i, |, k be three unit vectors located at the origin О of a given Cartesi- 
an coordinate system and pointing in the positive directions of the x-, y- 
and z-axes, respectively, as shown in Fig. 2.19. We consider a vector a which 
starts at the origin O of coordinates and ends at the point A. Let us draw 
through A planes perpendicular to coordinate axes. These planes intersect 
the x-, y- and z-axes at the points P, Q and R, respectively. It follows from 
Fig. 2.20 that 


a = OP + OO + OR. (2.1) 


The vectors OP, ӧд апа ОЁ аге collinear to the unit vectors i, j, k, 


respectively. Hence, there exist numbers x, y and z such that OP = xi, 
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00 = yj and OR = zk, and 
а= хі + yj + zk. (2.2) 


Formula (2.2) is referred to as the expansion of a with respect to the unit 
vectors i, j, k. 

Expansion (2.2) is applicable to any vector. Since i, j, k are of a unit 
length and any two of them are perpendicular to each other, we call the 
ordered triple (i, j, k) of unit vectors the orthonormal basis. It is easily 
seen that for any a the expansion is unique. The numbers x, y and z in 
(2.2) are called the coordinates of a. These are the same as the coordinates 
x, у and z of the terminal point A of a. So we write a = (x, y, Zz}. 





Fig. 2.20 Fig. 2.21 


Vectors xi, yj and zk in (2.2) are called the components of a. 
Let a= (xi, y1, zı} and b = (x, y», z2] be two vectors. Then a and 
b are equivalent if and only if their respective coordinates are equal, i.e., 


х = №, 
a=b iff {y =)», 
214 = 22. 


The position vector of a point M(x, y, z) is the vector r = xi + yj + zk 
which starts at the origin O of coordinates and ends at M(x, y, z) 
(Fig. 2.21). 

Linear operations on vectors specified by their coordinates. Let there 
be given two vectors а = {m, у, zi]. and b= (x, ул, 22| so that 
а= xi + yj + zik and b = xi + yj + zk. Then, by laws of addition we 
get 

a + b= (xni + yj + zik) + Goi + »j + wk) 


= (и + хэл + Ол + Xj + @ + 2)k 


32 ,  2.Elementsof Vector Algebra. — 000 0 0 00 1 Lc 


Or 
а+ = {хл + х,у + у), Z +}. 
Hence, addition of vectors becomes addition of their corresponding coor- 
dinates. 
Similarly, we have 
a-b-í(un-»0h-X»2-2)]. 
Also 


ha = №і + vd + dak 
or 
Ха = (Xn, Mi, Ха. 
Hence, multiplication of a vector by a scalar reduces to multiplication of 


all the coordinates of the vector by the scalar. 
Let a = (x1, у, zı} and b = (x2, y», 22| be collinear vectors, and b >“ 0. 


Then a = pb, i.e., 
X17 ро, Yı = ру, 21 = Z2 








Fig. 2.22 


ог 
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Conversely, if (2.3) is fulfilled, then a = yb, i.e., a and b are collinear. 
In other words, the vectors a and b are collinear if and only if their 
coordinates are proportional. 
Example. Find the coordinates of the vector MM, where 
М, Ол, Yı, Z1) is the initial point and M2(x», y», 22) is the terminal point 
of the vector (Fig. 2.22). 
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*« From Fig. 2.22 it follows that M; M, = г — rı, where г and г; are 
position vectors of M, and M5, respectively. Thus 


MM -(-Х,У-У»,2-11. 


Hence, we conclude that the coordinates of ММ, are equal to the differ- 
ences of the respective coordinates of М and Mi. > 


2.4 Projection of a Vector onto an Axis 
Let I be an axis and AB be a nonzero directed segment of / (Fig. 2.23). 
The magnitude of the directed segment AB of the axis / is the num- 
ber equal to the positive length of AB if the directions of AB and 
1 coincide, and to the negative length of AB otherwise. 





Fig. 2.23 Fig. 2.24 


Let AB be an arbitrary vector shown in Fig. 2.24. We drop from A and 
B perpendiculars onto the axis /. Thus we get the directed segment CD of I. 

Definition. The projection of the vector AB onto the axis l is the magni- 
tude of the directed segment CD as given above. 





Fig. 2.25 
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Basic properties of projections. (1) The projection of a vector AB onto 
an axis / is equal to the product of the length of AB and the cosine of 
the angle between / and AB (Fig. 2.25), ie., 


. TE — 
proj, AB = |AB| cos a. 


a | 
a+b+e | | 
rd | | 
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Fig. 2.26 


(2) The projection of the sum of vectors onto an axis / is equal to the 
sum of the projections of the vectors onto the axis. 
For example, from Fig. 2.26 if follows that 


proj(a + b + с) = projra + proj;b + proj: c. 


2.5 Scalar Product of Two Vectors 


Let there be given two nonzero vectors a and b. 
Definition. A scalar product of two vectors a and b is a number а, b such 
that 


a-b = |а| |b] cos = |а| |b] cos (a, b), (2.4) 
where ọ = (a, b) is the angle between a and b. 


Observe that |Ы cos y is the projection of b onto the axis determined 
by a. Then we may write (see Fig. 2.27) 1 


a'b = |а| proje b. (2.5) 
Similarly, we have 
a-b = |b| projp a. 


Hence, a scalar product of two vectors is a number equal to the product 
of the length of one vector by the projection of the other vector onto the 
axis determined by the first vector. 

When either a or b is a zero vector, we have a-b = 0. 
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Properties of the scalar product. (1) The scalar product of a and b van- 
ishes if and only if a and/or b are zero vectors, or if a and b are perpendicu- 
lar (written a 1 b). 

-« This immediately follows from (2.4). > 

Since the direction of the zero vector is not defined we may set it perpen- 

dicular to any vector. So we may write the property in question as 


alb iff a-b=0. 





(a) 





Fig. 2.27 


(2) The scalar product obeys the commutative law, i.e., 
a-b = b-a. 


“4 This immediately follows from (2.4), taking into account that cos e 
is an even function. > 

(3) The scalar product obeys the distributive law relative to the sum 
of vectors, i.e., 


(a + b)-c = a-c + Dee. 
“4 Indeed, 
(а + Б):с = |c|-proje (a + b) = |e|-(projea + proje b) 
= |e|-projea + |c|-proj-b = arc + ес, » 


(4) If a or b is multiplied by a scalar ^, the scalar product of a and 
b is multiplied by А, i.e., 


(a): b = a-(Ab) = A(a-b). 
<4 Let л > 0. Then 
d(a-b) = Хај |b] cos (a, b) 
and 
(ха): = [A [al |b| cos Qa, b) = №а| |b] cos (a, b), 


3* 


36 2. Elements of Vector Algebra 


since the angles (a, b) and (ха, b) are equal (Fig. 2.28). 
We proceed similarly when ^ « 0. 
The case of А = 0 is trivial. > 
Remark. In general 


(a-b)-c # a- (b-c). 


Fig. 2.28 


Scalar product of two vectors specified by their coordinates. Let 
a= (хм, yı, 2) and = {x2, у», z2] be two vectors specified by their coor- 
dinates with respect to the orthonormal basis (i, j, k). 

We consider the scalar product of a and b, i.e., 


a'b = (xi + yj + zik) (6i + »j + zk). 
By the distributive law, we have 

a'b = xix (i-i) + yeli) + (К) + xi»-j) 

+ ($$) + zalk) + xz) + na k) + zo(kk). 

Observe that 

i-j = i-k = j-k = 0, 

iti = j-j = К.К = 1. 
Whence we obtain 

arb = xix; + узу» + 2122. І (2.6) 


We conclude that, given vectors а and b specified by their coordinates 
with respect to the orthonormal basis (i, j, k), the scalar product of a and 
b is equal to the sum of the products of their respective coordinates. 

Example. Compute the scalar product of 


a=4i-2j+k and b= бі + 3j + 2k. 
4 ab=4x6+(-2)x34+1xX2=20. > 


The scalar square of the vector a is a-a = a’. 


2.5 Scalar Product of Two Vectors 37 


Using (2.6) and setting b — a, we obtain 
а? = хү + yi + @. (2.7) 
On the other hand, 
а? = а-а = |а|?-со5 0 = Ja? 
so that (2.7) gives 


|а| = Мх + yp +z. (2.8) 


Hence, the length of a vector specified by its coordinates with respect to 
the orthonormal basis is equal to the square root of the sum of squares 
of the coordinates. 

Direction cosines. By definition, the scalar product of two vectors a 
and b is 


a-b = [а| |b| cos e, 


where is the angle between a and b. 
Provided that both a and b are nonzero vectors, the angle between a 
and b is given by 





8: 
cose = НЫ" (2.9) 
For a = (x1, у, zi] and = (x, yo, 2} (2.9) becomes 
COS ф = хэнээ ae (2.10) 


Nxt t+ yi + 41 -Nx2 + у + 552 


Example. Find the angle between а = (2, —4, 4} and b = ( —3, 2, 6). 


ча Using (2.10), we get 


-6-8424 _ 
cos ф = = 


5 
Ма +16 + 16 М9 +4+36 21. 


The value of o is easily obtained from tables of trigonometric func- 
tions. » 
Let b = i, that is, b = (1, 0, 0}. Then for any a = (xi, y1, zi) z 0 and 
the unit vector i we have 
ач 
cosa = —— 
al 





or 
xi 


== Ше. 
NXL + Yi + Zi 


where о is the angle between а and the x-axis. 


cos а = (2.11) 
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Similarly, 
у aj 
cos 8 = — ——s 77 ial , 
Nxityitz 
21 a-k 
cos y = = 





Jb ry tz lal 


(2.12) 


(2.13) 


Formulas (2.11)-(2.13) define the direction cosines of the vector a, i.e., 
the cosines of angles between a and the coordinate axes (Fig. 2.29). 





Fig. 2.29 


Example. Find the coordinates of the unit vector n°. 
< Let n° = xi + yj + zk. Since [n?| = 1, we get 


0 


n?-j = у = cos, 


n°-k = z = cos y. 


n^.i = x = [n?| [i| cos (n^, i) = cosa, 





Thus, the coordinates of the unit vector are direction cosines of this 
vector, i.e., cosines of the angles between n° and the coordinate axes, so that 


n? = icosa + jcos 8 + kcos y. 
Whence 


(n°)? = n°-n° = 1 = cos?a + cos? 8 + cos?y. в» 


Example. Let n^ be orthogonal to the z-axis, i.e., 


n? = xi + yj. 


2.6 Vector Product of Two Vectors : 39 








Then the coordinates x and y of n? are 
х= соѕф and у = sine, 
so that (Fig. 2.30) 


n? = icos e + jsing. 


2.6 Vector Product of Two Vectors 


Definition. The vector product of two noncollinear vectors a and 
b is a vector a x b such that 
(1) the length of a x b is equal to |а| |b] sin p, where х is the angle be- 
tween a and b (Fig. 2.31); 





Fig. 2.31 


(2) a X b is perpendicular to both a and b, i.e., a X b is perpendicular 
to the plane determined by a and b; 

(3) a x b is so directed that the shortest rotation from a to b is made 
counterclockwise as seen from the terminal point of the vector a x b 
(Fig. 2.32). In other words, the ordered triple of vectors (a, b, a х b) is 
right-handed, i.e., the thumb, the index finger and the middle finger of the 
right hand point along the directions of a, b and ах b, respectively. 


Fig. 2.32 
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When a and b are collinear, a x b = 0. 
The length of the vector product of a and b denoted by 


ах» = |а| |b| sin e (2.14) 


is numerically equal to the area So of the parallelogram with sides a and. 
b (Fig. 2.33), ie, 


la x b| = Sp. 


Fig. 2.33 


Properties of the vector product. (1) The vector product of a and b 
is equal to the zero vector if and only if a and/or b are zero vectors, or 
if a and b are collinear. (Recall that if a and b are collinear then the angle 
between a and b is either zero or т.) 
ча This immediately follows from |a x b| = |а| |b| sine. » 

We may set the zero vector collinear to any vector and express the 
property as 


alb iff axb=0. 
(2) The vector product obeys the anticommutative law, i.e., 
bxa- -(axb) (2.15) 


ча Indeed, a X b and b х a are collinear and of the same length. Since 
the shortest rotation from a to b is counterclockwise for a х b and is clock- 
wise for b X a, we conclude that a X b and b х a are of opposite directions 
(Fig. 2.34) ». 

(3) The vector product obeys the distributive law relative to the sum 
of vectors, i.e., 


(а + ъ) хс=ахс + х с. 


(4) If a or b is multiplied by а scalar ^ the vector product of a and 
b is multiplied by ^, i.e., 


(sa) x b = a x (Ab) = À(a X b). 
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Remark. The vector product is not associative. Indeed, the identity 
(a x b) x c = a x (b с) іѕ not generally true. For example, 


üxj*xj--i but ix(óxj-0. 
Vector product of two vectors specified by their coordinates. Let 


а = (x1, у, 1} and = (x, у», 22] be two vectors specified by their coor- 
dinates with respect to the orthonormal basis (i, j, k). 





Fig. 2.34 


Fig. 2.35 


By the distributive law, we obtain 


a X b= (xii + у} + zi) x (оі + yj + zk) 
= Xo (i X i) + x1» * D + mali КК) + yim x i) 
+ yV х j) + Nz х k) + Zk x i) + z»(k x j) 
+ 21Z2(k х k). (2.16) 
The vector products of basis vectors (see Fig. 2.35) are 
ixi=jxj=kxk=0, 
ixj=k, jxk=i, kxi=j, 
j*i--k kxj--i ixk= -j. 
Substituting these relations into (2.16), we obtain 
ахь = —xyk + 21] + x ek — арі + yzi — maj 
= 1012 — уд) + 10021- xim) + Елу» — xy). — (247) 
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To memorize (2.17) it is helpful to rewrite it in the from of the third- 

order determinant as 

i j К 

xi WM 2 

xX jz 2 


axb= : (2.18) 








Expanding (2.18) with respect to the first row, we obtain (2.17). 
Examples. (1) Compute the area of a parallelogram with sides 
a=i+j-—k and b = 2i + j — К. 
48 The area we are seeking for is So = |a x b|. By (2.18), we get 
i j k 
axb-|] 1 -11-4140-11-К-4-1--1-4К. 
2 1-1 








Whence 
So = v1 + = №. » 


(2) Compute the area of the triangle OAB shown in Fig. 2.36. 





Fig. 2.36 


ча We clearly see that the area SA of ОАВ is half the area Sp of the 
parallelogram OACB. Computing the vector product of а = OA and 


b= OB, we obtain 








i j k 
axb-|x JA 0| = ба» = Хэу1)ЇЕ. 
х » 0 
Whence 


So = ba» -xy| and Sa = 3 bay» — xil > 
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Scalar triple product. Let there be given three vectors a, b and c. 
We find the vector product a X b of a and b, and then the scalar product 
of a X b and c, ie, the number (a X b)-c. 
The number 


(a x b)-c 


is called the scalar triple product of three vectors a, b and c. 


Geometric interpretation of the scalar triple product. Let three vectors 
a, b and c be applied to some point O as shown in Fig. 2.37. If the points 
O, A, B and C lie in the same plane, ie, if the vectors a, b and c are 
coplanar, their scalar triple product (a X b):c is equal to zero, i.e, 





Fig. 2.37 


(а x b)-c = 0, since the vector a X b is perpendicular to the plane contain- 
ing the vectors a and b and hence c. In other words, the vectors a X b 
and c are perpendicular to each other and their scalar product is equal 
to zero. 

When the points O, A, B, C do not lie in the same plane, ie, when 
a, b and c are noncoplanar, we construct a parallelepiped with edges OA, 
OB and OC (Fig. 2.38). 

By definition 


a x b= Se, 









> 


(а) 





la 





Fig. 2.38 
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where $ is the area of the parallelogram OADB, e is the unit vector perpen- 
dicular to both a and b so that the ordered triple of vectors (a, b, e) is 
right-handed, ie, the thumb, the index finger and the middle finger of 
the right hand point along the directions of a, b and e, respectively. 
Computing the scalar product of a X b and c, we obtain 


(a X b)-c = (Se)-c = S(e-c) = S projec. if 


The number S projec is equal to the positive height Л of the parallele- 
piped if the angle y between e and c is acute, i.e, if the ordered triple of 
vectors (a, b, c) is right-handed, and to the negative height if the angle 
€ is obtuse, i.e., if (a, b, c) is the left-handed ordered triple of vectors, so that 


(a x b):)e = +Sh = xV. 


Therefore the scalar triple product of a, b and c is numerically equal 
to the volume V of the parallelepiped with edges a, b, c when the triple 
(a, b, c) is right-handed, and to minus V when the triple (a, b, c) is left- 
handed. 

The geometry reveals that multiplying the three vectors in any other 
order, we obtain either +V or — V. 

Notice that when the triple (a, b, c) is right-handed, both (b, c, a) and 
(c, a, b) are also right-handed, while all the triples (b, a, c), (a, c, b) and 
(c, b, a) are left-handed. Thus 


(a x b)-c = (b x c)-a = (сх a)-b 
= —(b*a)c- —-(axc)b- – (сх b)-a. 


Once again we shall emphasize the following: the scalar triple product 
of three vectors a, b, c is equal to zero if and only if a, b and c are coplanar. 
In symbols, 
a, b, c are coplanar iff (a x b)-c = 0. 


Scalar triple product of three vectors specified by their coordinates. Let 
a= (x, y1, Z1}, bD = (22,2, 22] ande = fx, ys, 23) be three vectors speci- 
fied by their coordinates with respect to the orthonormal basis (i, j, k). 
Let us express the scalar triple product (a X b)-c in terms of coordinates 
of a, b and c. 


We have 
ij К 
axb-j|x л а 
x» у» 2 


їл — уд) + jz – x12) + КОпу- Хул). 
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Whence 
(a X b)-c 
xi Ул 41 
JA 4 xi 4 л n 
= хз x ын -13 p» 201. 
» 2 2 22 2 № X3; » 23 




















Thus the scalar triple product of three vectors specified by their coor- 
dinates with respect to the orthonormal basis (i, j, k) is equal to the third- 
order determinant whose rows are coordinates of the vectors being mul- 
tiplied. 

The necessary and sufficient condition for three vectors a= 
(м, Эл, 1], b = fx», р, 22) and с = (хз, уз, 23) to be coplanar may be 
written as 


xi à 2 
ою » 2] = 0. 
Хз уз Z3 


Example. Verify whether the vectors a = (7, 4, 6}, b = (2, 1, 1} and 
c = (19, 11, 17} are coplanar. 
ча The vectors are coplanar if 


7 4 6 
A=]2 1 1 
19 1] 17 


is equal to zero, and noncoplanar otherwise. 
Expanding A with respect to the first row, we obtain 


A=7xX6-4x15+6x3=0. 


Thus we conclude that the vectors are coplanar. В» 

Vector triple product. The vector triple product a х (b х c) of three 
vectors a, b and c is the vector perpendicular to both a and b x c. Hence 
the vector a x (b x c) lies in the plane determined by b and c and may 
be expanded with respect to b and c. It is easy to show that 


ax(bxc)-b-(a:c)—c-(a-b). 


Exercises 


1. The vectors AB = с, BÓ = а and CA = b are sides of a tri- 
angle Express the medians of the triangle in terms of a, b and c. 
2. Let p, q and p + q be three vectors applied to a common point. 
Find the relationship between p and q provided that p + q bisects the angle 
between p and q. 
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3. Compute the lengths of the diagonals of a parallelogram with sides 
а-5р-24 and Ь=р-— 3q provided that |p| = 2V2, || = 3 and 
(p, q) = 7/4. e 

4. Prove that the diagonals of the rhombus are perpendicular to each other. 
5. Compute the scalar product of a = 4i + 7j + ЗК and = 3i — 5j + К. 
6. Find the unit vector a^ which is parallel to a = (6, 7, —6}. - 

7. Find the projection of the vector a = i + j — k onto the axis determined 
by the vector b = 2i — j — 3k. 

8. Find the cosine of the angle between the vectors AB and AC, given 
А(-4, 0, 4), B(— 1, 6, 7) and СО, 10, 9). | 

9. Find the unit vector p^ which is perpendicular both to the vector а = 
(3, 6, 8} and to the x-axis. 

10. Find the sine of the angle between the diagonals of the parallelogram 
with sides a = 2i + j - k and b =i — 3j + К. 

11. Find the height A of the parallelepiped with edges a = 3i + 2j — 5k, 
b = i — j + 4k and c =i — 3j + k provided that the base of the parallele- 
piped is the parallelogram with sides a and b. 


Answers 





1. AM = ¢ + 5 or AM = £5"; BN -a+2 or BN = *=*; C = 
b+ = or CP = LM 2. |p| = |q| since the diagonal of the parallelogram bisects the 
angle only if the parallelogram is the rhombus. 3. |a + b| = 15, |a — b| = V593 . 5. —20. 

Bue 7. cue. o= 6 _1 Æ касыз ы 

6. a [i 11” 8| or a { 11” 117 s). 7. proj a Jn 
4 3 : 248 49 

8. cose = 1, е = 0. 9. p? = +10, --,-1.10 sing = <=. 11. h = —__., 

Ци k { 5'5 ] E 273 4323. 





Chapter 3 


The Line and the Plane 


3.1 The Plane 


Normal and general equations of a plane. We may determine a plane 
in space by specifying, relative to a given point O, two quantities, namely 
(1) the perpendicular distance p between O and the plane, i.e, the length 
of the perpendicular OT dropped from O onto the plane, and (2) the unit 
vector n? which starts at O and is perpendicular to the plane, as shown 
in Fig. 3.1. 





Fig. 3.1 


Suppose that an arbitrary point M moves in the plane v. Whatever the 
— . ». 
path of moving, the position vector r = OM varies so that the relation 


proje OM = p (3.1) 


Temains valid unless M leaves the plane. In other words, relation (3.1) 
describes a property peculiar to all points of the plane, i ie., ciation (3.1) 
is the equation of a plane. Observing that proj,,OM = r-n?, we may 
write (3.1) as 


r-n? — р = 0. (3.2) 


Equation (3.2) is called the normal vector equation of a plane. Тће posi- 
tion vector r is sometimes referred to as a moving position vector. 


48 3. The Line and the Plane 


Let us set up in space a Cartesian coordinate system with the origin 
at О. Then n? = (cosa, cos В, cos у) and r = (x, у, 2), and the equation 
(3.2) takes the form 


xX cosa + ycos8 -zcosy — p = 0, (3.3) 
in which case 
(1) cos? а + cos? 8 + cos? у = 1, 


(2) the term (—p) is nonpositive. 

Equation (3.3) is called the normal coordinate equation of a plane. 

Since (3.3) is the first-degree equation in x, y and z, we conclude that 
in three-dimensional space a plane is determined by the first-degree equa- 
tion in Cartesian coordinates x, y and z. 

The converse is also true, namely any first-degree equation in Cartesian 
coordinates x, y and z defines a plane. 
“4 Indeed, let us consider a general first-degree equation 


Ax + By + Cz+D=0 (A? + В? + С? > 0). (3.4) 
Multiplying (3.4) by a scalar д, we obtain 
Ах + By + Cz + ир = 0. (3.5) 


We choose и such that (3.5) reduces to the normal equation (3.3). It 
suffices to set 


pA=cosa, pB=cos8, pC=cosy, uD = -p. (3.6) 
From (3.6) we obtain 
uA? + В? + С?) = 1. 


Whence 
1 


VA? + В? + С? 


From aD = -p < 0 it follows that the sign in (3.7) must be opposite 
to the sign of D. Substituting (3.7) into (3.6), we find expressions for cos a, 
cos B, cos y and p in terms of the coefficients of equation (3.4). In other 
words we reduce equation (3.4) to the normal coordinate equation (3.3). 
The scalar y is said to be a normalizing factor. Since the normal coordinate 
equation defines a plane, so does equation (3.4), called the general equation 
of a plane in three-dimensional space. 9» 

Therefore, any first-degree equation in x, y and z defines a plane in 
space as a set of points whose coordinates satisfy this equation. 

We shall call any nonzero vector perpendicular to a given plane the nor- 


к= + (3.7) 
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mal vector to this plane. This implies that the vector п = (A4, B, C] is 
a vector normal to the plane (3.4). Hence, A, B and C are easy to interpret 
as coordinates of the vector n = (A4, B, С} normal to the plane (3.4) (see 
Fig. 3.2). Any other normal vector is obtained by multiplying n by a non- 
zero scalar. 


п= (А, 8,6} 









Агт+Ву+Сг+0= 0 


Fig. 3.2 


Equation of a plane which passes through a given point and is perpen- 
dicular to a given direction. We wish to find an equation of a plane which 
passes through a point Mo specified by the position vector го = 
(Xo, Yo, Zo} and is perpendicular to the vector п = (A, B, C}. 

Let r = (x, y, z) be a position vector of an arbitrary point M of a 
plane (Fig. 3.3). The vector MM = r — ro lies in the plane and, conse- 
quently, is perpendicular to the vector n. So the scalar product of the vec- 
tors г — ro and n is equal to zero, i.e., 


(т — ro):n = 0. (3.8) 
The identity (3.8) remains valid for all points of the plane and does 


not hold for any point outside the plane. Hence, (3.8) is the normal vector 
equation we are seeking for. Expressing (3.8) in terms of coordinates of 





4— 9505 
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т — ro and n, we obtain the desired equation 
A(x — xo) + BO — yo) + CCR – zo) = 0. (3.9) 


The angle between two planes, we consider two planes specified by the 
equations 


Aix + By + Ciz £ Dj 20 (Ai + В + Сү > 0), 
Азх + Ву + Од + D = 0 (А2 + В? + С? > 0). 


The angle between two planes is any of the two dihedral angles formed 
by these planes (Fig. 3.4). When two planes are parallel the angle between 
them is equal either to zero or to т. 





Fig. 3.4 


We easily see from Fig. 3.4 that one of the two dihedral angles is equal 
to the angle ф between the vectors ш = (4i, Bi, Ci] and m= 
(42, В, C2), whence we have 


TUNES m:m — A142 + В.В, + СС (3.10) 


юг · || А + В? + С? МА? + В + С? | 


Conditions of perpendicularity and parallelism of two planes. If two 
planes are perpendicular to each other, their respective normal vectors are 
also perpendicular. Hence, there holds 

n; ° п = 0 
ог 
A142 + В.В + СС; = 0. 


This is the condition of perpendicularity of two planes. 
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If two planes are parallel, their respective normal vectors are collinear, 
ie, n; = Am; Expressing this identity in terms of coordinates of n; and 
п, we obtain 


A = №, В = №, С = № 
Or 


This is the condition of parallelism of two planes. 
Problem. Find the equation of the plane passing through three noncol- 
linear points Mi(Qa, yi, 21), M2, y», 22) and M3(x3, уз, 23). 


3.2 Straight Line in a Plane 


An approach identical to that we used in preceding sections is fully 
applicable when we consider a (straight) line in a plane. 
Here we present the major results with reference to accompanying 
figures. 
1. Standard forms of the equation of a line: 
(a) The normal vector equation (Fig. 3.5) 


гэв?-р-0, 





Fig. 3.5 Fig. 3.6 


(b) The normal coordinate equation (Fig. 3.6) 
xcosg + ysing – р = 0. 
(c) The general equation (Fig. 3.7) 
Ах + Ву+С=0 (А? + В? > 0), 
where А and В are coordinates of the vector n = (A, B) normal to the line. 


4* 
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Multiplying the general equation by 
1 


к= £ — 
VA? +B? 


we may reduce the general equation to the normal coordinate equation. 






Ax + Ву+С= 0 





Fig. 3.7 Fig. 3.8 


The sign of » is chosen so that 
pC = -p <0. 
We mention two interesting cases: 
(a) If B #0, then setting k = — 4 and b = - $ (see Fig. 3.8), we 
obtain the s/ope intercept form of the equation of a line 
у= Кх +b. 
(b) If ABC эе 0, then setting a= – С and b= – © (Fig. 3.9), we 


obtain the intercept form of the equation of a line 
РЕ 
+ b 1. 


а 





Fig. 3.9 Fig. 3.10 
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2. The distance between the point M* and the line L is the length of 
the perpendicular segment M*N dropped from M* onto L (Fig. 3.10). 
Let M*(x*, y*) be a given point and 
xcosy+ysing-p=0 
be a normal coordinate equation of L. Then the distance between M* and 
L is 
d = d(M*, І) = x*cos e + y*sine — pl. 
< Letr bea position vector of an arbitrary point M on L. Then г: n? = p. 
We denote the position vector of M* by r*. The difference r* - n? — р 
is equal to the projection of the vector r* — r onto the axis Z* determined 
by n°, ie, 
r* n! — p=r*-n® — ren? = (r* — г): n? = projr«(r* — г), 


as shown in Fig. 3.11. 


yl 














Fig. 3.11 


Finding the absolute value of the above identity, we obtain 
[r* -n° — p| = d (M*, L) 
or, in terms of coordinates, 
d(M*, І) = x* cose + y* sing — pl. ® 
If L is specified by the general equation 
Ax + By + C = 0, 
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the distance between M* and L is 
|Ax* + By* + C| 
Remark. Similarly, we may define the distance between a point and a 
plane. If a plàne is given by the general equation 
Ах + By + Cz + р = 0, 


d(M*, 1) = 


then the distance between the plane and the point M*(x*, y*, z*) is 
jas |Ax* + By* + Cz* + D| 


VA? + В? + С? 


d(M*, т 





Fig. 3.12 Fig. 3.13 


3. The equation of a line which passes through a given point 
Мо (х, yo) and is perpendicular to a given normal vector n= 
(A, B) #0 (Fig. 3.12) is 


A(x — х) + B(y — yo) = 0. 
4. The angle between two lines Aix + Biy + Cy = 0 and Ax + 
Ву + С = 0 (Fig. 3.13) is given by 


A142; + B1B; 


VAI + By -VA2 + Bz 
The two lines are 
(a) perpendicular to each other iff 
A142 + BiB = 0, 
(b) parallel iff 
А 8 


“Аз B 


cos ¢ = 
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3.3 Straight Line in Three-Dimensional Space 


Equation of a line. The line is uniquely determined in space by a 
position vector ro of a point Mo belonging to the line and a nonzero vector 
s parallel to this line. The vector s is called the direction vector of the line. 


Let r= OM bea position vector of an arbitrary point M on the line. 
From Fig. 3.14 we easily see that 


OM = ОМ, + MoM. (3.11) 


The vector MoM is parallel to the vector s so that MoM = st, where 


t is a scalar whose value depends on the position of M on the line. Hence, 
(3.11) may be written as 


т = ro + Sf. (3.12) 


Equation (3.12) is called the parametric vector equation of a line. 





Fig. 3.14 


We set up a Cartesian coordinate system with the origin at O. Let us 
denote the coordinates of Мо, i.e., coordinates of ro, by xo, уо, 20: the coor- 
dinates of an arbitrary point M, i.e., the coordinates of г, by x, y, z; and 
the coordinates of the direction vector s by /, т, n. Then the vector equa- 
tion (3.12) becomes 


X — Xo t lf, у= yo + mt, z— zo + nt. (3.13) 


Equations (3.13) are called the parametric equations of a line. The num- 
bers /, m and n are referred to as direction numbers of a line. 
Eliminating t from (3.13), we obtain 


ео Ус 21-20 _ 
1 т п 
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Whence 
X-% y-y | -% 











7 т P EX (3.14) 


where Xo, уо, zo are the coordinates of Mo belonging to the line and /, т, 
п are the direction numbers (Ê + m? + n? > 0). 
Relation (3.14) is said to be the point direction equation of a line. 
Any two equations obtained from (3.14), say 
х-%_у-% ad 220. - 
I m m n 
define a line as a line of intersection of two planes. 

Equation of a line passing through two given points. We wish to find 
equation of a line passing through the points Mi(xi, yi, zı) and 
M202, Y2, 22). Suppose we are seeking for the point direction equation of 
a line. So we need to know the coordinates of any point belonging to the 
line and the direction vector. Let us select Mi(x1, Ул, 21) as the point we 








need and define the direction vector as М.М = (3 —x, »3- Xi, 
22 — 41}. Then, the desired equation is 
х-хХ - - 
iurc NC EN (3.15) 
x2 — x Jz = у 20-12 
Remark. If Mı and M: lie in the xy-plane, i.e., if zı = z = 0, the equa- 
tion of a line passing through М, and М» is 








x-x»n у-и 


0 — M »-» 








Example. Find the equation of a line passing through №. (1, 0, —1) and 
М(3, 1, 1). 
ча Using (3.15), we obtain the desired equation as 

х-1 y 2+1 
poc» 

General equation of a line. Reducing the general equation to the point 
direction equation of a line. A line in space may be determined as a line 
of intersection of any two distinct nonparallel planes. 

Let two distinct nonparallel planes be specified by the equations 
Aix + Biy + Ciz + Di = 0 and Ах + y + Оох + D: = 0. 

Then the system of the two equations 


Ded pim Cız + Dı = 0, 
Ах + Ву + Qz + D = 0 


is said to represent the general equation of a line. 








(3.16) 
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System (3.16) may be reduced to the point direction equation of a line. 
This reduction requires a point belonging to the line and a direction vector 
be known. 

We may find the coordinates of a point belonging to the line from (3.16) 
by setting one unknown equal to an arbitrary value and solving the system 
of two equations thus obtained to get the other two coordinates of the 
point we are seeking for. Notice that the direction vector s being of the 
same direction and lying in the line of intersection of the given planes must 
be perpendicular to the normal vectors n; = (4i, Bi, Ci} and m = 
14», B2, C}. Conversely, any vector perpendicular to n; and m is parallel 
to both planes, ie, this vector is parallel to the line of intersection of the 
planes. The vector product n; X n; is known to be perpendicular both to 
nı and m. Hence, we may take s = m X m as the direction vector of the 
plane. 

Example. Reduce 


dp = 0, 


x+y-2z+1=0 б) 


to the point direction equation of a line. 
ч We find a point belonging to the line (3.17). Setting, for example, z = 0, 
we arrive at the system 


х- у= 3, 
х+у= –1. 


Whence, x = 1 and y = -2. Thus М(1, – 2, 0) belongs to the line. 9 





The normal vectors to the planes are m= (l, —1, 1} and 
№ = {1, 1, —2}. Then the direction vector of the line is 
i i k 
s-nmXm-l|l1 -1 1| =i + 3j + 2k. 
1 1 -2 
Hence, the point direction equation of a line specified by the system 
(3.17) is 
x-1 у+2 Z 
aE tS » 
1 3 2 
Angle between a line and a plane. Let there be given a line 
db 190.1 у= ую. 0 (3.18) 
1 т n 
and a plane 


Ах + By + Cz + р = 0. (3.19) 
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The angle ¢ between the line and the plane is the smallest angle between 
the line and its projection onto the plane (Fig. 3.15). 
Let а be an angle between the line (3.18) and the vector normal to the 
plane (3.19). Then 
nes 


cosa = 
In| 8] 








where s is the direction vector of the line (3.18) (see Fig. 3.16). 


23 


Fig. 3.15 


Observe that |cos а] = sing. Then 
: |Al + Bm + Cn| 
sin o = ——— ——————————————. 
VA? + В? + С? . МР + т? + п? 


When the line (3.18) is parallel to the plane (3.19), the direction vector 
s of the line is perpendicular to the vector n normal to the plane so that 


(3.20) 





n-s=0 
or 
Al + Bm + Сп = 0. (3.21) 
1. 
7) 
Л 
п 
sin? --cos d sin? =cos q 
Zeger o<a< Z 


2 
Fig. 3.16 
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This is the condition of parallelism of a line and a plane. 
When the line (3.18) is perpendicular to the plane (3.19), the vectors 
s and n are parallel, ie, s 1 n and 


==, (3.22) 


A.B.C 
il m n 


This is the condition for a line and a plane to be perpendicular. 
Intersection of a line and a plane. Let there be given a line 


х- х _у-% _-%@ 





1 P Я (3.23) 
and a plane 
Ах + By + Cz + Р” = 0. (3.24) 


The coordinates of the point at which the line (3.23) and the plane (3.24) 
intersect are to satisfy both (3.23) and (3.24). Thus, we must solve the sys- 
tem containing (3.23) and (3.24) in three unknowns x, y, z to obtain the 
point at which the line and the plane meet. 

We reduce equation (3.23) to the parametric equations 


X — Xo t [f, у= yo + mt, z — zo 4 nt. (3.25) 
Substituting (3.25) into (3.24), we obtain 
Ах + Ву + Со + D + t(Al + Bm + Сп) = 0. 


Whence 
Ax + Ву + Со + D 
Al + Bm + Сп 


Substituting (3.26) into (3.25), we obtain the coordinates of the desired 
intersection point. 

If Al + Bm + Cn = 0 and Ax + Byo + Cz% + D # 0, the line (3.23) 
is parallel to the plane (3.24) and the point (xo, yo, zo) does not belong 
to the plane. Hence, the line (3.23) and the plane (3.24) have no points 
in common. 

When Al + Bm + Cn = 0 and Ах + В» + Со + D = 0, the former 
equality implies that the line and the plane are parallel to each other and 
by virtue of the latter equality the point (xo, yo, zo) belongs to the plane 
(3.24). Hence, the line (3.23) is contained in the plane (3.24). 

Example. Find the point of intersection of the line 


(Al + Bm + Cn # 0). (3.26) 


t=- 


pel wi nie (3.27) 
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with the plane 


ax+y+7z-3=0. (3.28) 
-4 We reduce (3.27) to the parametric equations 
х-7-3, y=3tt, z= -1-2t. (3.29) 


Then substituting (3.29) into (3.28), we obtain 


6 + 144+44+3-144-7-3=0 
or 
—-7t+7=0. 


Whence, | = 1. 

Now we get from (3.29) the coordinates x = 10, y = 4, z = —3 of the 
intersection point. Р» 

Problem. Find the distance between the point Mi(xi, Ул, zı) and the line 


Exercises 


1. Write the equation of (a) the plane which is parallel to the 
xz-plane and passes through the point (2, —5, 3); (b) the plane which con- 
tains the z-axis and passes through the point (—3, 1, —2). 

2. Given the points A(1, 3, —2) and B(7, —4, 4), write the equation of 
a plane which passes through B and is perpendicular to the segment AB. 
3. Write the equations of (a) the plane which passes through the point 
(-2, 7, 3) and is parallel to the plane x — 4y + 5z — 1 = 0; (b) the plane 
which passes through the origin of coordinates and is perpendicular to the 
planes 2x — y + 5z + 3 = 0 апі х + 3у – 2 – 7 = 0. 

4. Write the equation of the plane, such that the point P(3, — 6, 2) is the 
foot of the perpendicular dropped from the origin of coordinates onto the 
plane. 

5. Write the equation of the plane passing through the origin of coordinates 
and the points A(3, —2, 1) and В(, 4, 0). 

6. Write the equation of the line passing through the origin of coordinates 
and the point (a, b, c). 

7. Compute the angle between the lines 


х-1 yt2 z-5 х _ y-3. 241 
3 6 2 2 9 6 ` 
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(The angle between two given lines is that between any two lines which 
pass through an arbitrary point in space and are parallel to the givén lines.) 
8. Reduce the equations of the line 


peu ie 
х-2у+@+3=0 
to the point direction equation. 


9. Write the equation of the line which passes through the point 
(2, -5, 3) and is parallel to (a) the z-axis, (b) the line 


(c) the line 


2x-y+3z-1=0, 
5х + 4у - 4-7 = 0. 


10. Write the equation of the line which passes through the point 
(1, —1, 0) and is perpendicular to the plane 2x — 3y + Sz- 7 = 0. 
11. Find the point of intersection of (a) the line ЕН - 2-4 = 47° 


with the plane 3x — y + 22 — 5 = 0; (b) the line х+1 Нела 





with the plane 3x – 3y + 2z – 5 = 0. 
12. Write the equation of the plane which passes through the origin of 
coordinates and is perpendicular to the line x i 2 = x = i 





13. Find the projection of the point A(4, —3, 1) onto the plane 
xt2y-z-3-0. 

14. Write the equation of the plane which passes through the point 
(4, —3, 1) and is parallel to the lines 





Rr NNI n хал JO гэ 

“бий ил M лийр Явц a 
15. Write the equation of the plane which passes through the line 
х-3 _у+4 2-2 . f x+5 _ 
9 ерта and is parallel to the line 1 
у-2 2-1 
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Answers 


1. (a) у + 5 = 0; b) x + 3y = 0. 2. 6х — Ty + 6z – 94 = 0. 3. (а) x - 4y + 5z + 
06 0) 2x-y»-z20 4. 3х - бу + 22 – 49 = 0. 5. 4х - у – 142 = 0. 





ху „© a шу „ЖӨ. int lyi 
6 => 2 . 7. cos р 7 . 8 9 < = ; provided that the point lying on the 
+ 


y+5=0 4. 
Ju on (с) x-2 у+5 _ 4-3 10. х-1_у+1_4@ 
-6 9 -11 17 13 2 -3 5 


line is (0, 0, —3). 9. ate MESES A А8 Ea 2 or Hera w -= 























11. (a) (2, 3, 1); (b) the line is parallel to the plane. 12. 4x + 5y — 2z = 0. 13. (5, -1, 0). 
14. 16x — 27у + 142 — 159 = 0. 15. 23x – 16у + 104 – 153 = 0. | 


Chapter 4 


Curves and Surfaces of the Second Order 


4.1 Changing the Axes of Coordinates in a Plane 


Suppose that we are given two Cartesian coordinate systems Oxy 
and O'x'y' in a plane (Fig. 4.1). We may indicate the location of an ar- 
bitrary point M in a plane by giving its coordinates (x, y) on the x- and 
y-axes or its coordinates (x', y’) on the x’- and y’-axes. It is clear that 
the coordinates (x, y) and (x', y’) are somehow related. We shall try to 
ascertain relationships between (x, y) and (x', y') and, consequently, those 
between the axes of the two Cartesian coordinate systems in a plane. 


y 
, x! 


Fig. 4.1 


Translation of the axes of coordinates. Suppose that the Cartesian coor- 
dinate system O’x’y’ is obtained by moving the x- and y-axes parallel to 
themselves so that the origin of coordinates O goes into the point O’ 
(Fig. 4.2). This means that the unit vectors of the parallel axes are 
equivalent. 





Fig. 4.2 
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Let r and r' be position vectors of a point M relative to the origins 
О and О’ of the given Cartesian coordinate systems Oxy and O'x'y' 
(Fig. 4.3). Then we have 


r= xi + yj, 
r'-x'i* yj 
and 
OÓ' = ai + Bi, 
where а and f are the coordinates of the origin О” on the x- and y-axes, 
respectively. 
Since 
r=r’+ 00”, 
we obtain 
xi + yj = (x'i + y'i) + (oi + Вр). 
Whence 
x=x' +a, 
у=у' +8. 





Fig. 4.3 Fig. 4.4 


Rotation of the axes of coordinates. Suppose that the x'- and y'-axes 
are obtained by rotating the x- and y-axes through an angle ¢ so that the 
origins of the two Cartesian coordinate systems coincide (Fig. 4.4). 

We find coordinates of the unit vectors i’ and j’ relative to the Cartesian 
coordinate system Oxy (Fig. 4.5). It is easy to see that the coordinates of 
the unit vector i’ are the cosines of the angles р and 5 — ф made by the 
unit vector i^ with the x- and y-axes, respectively. Thus, we may write 


i’ = ісоѕе + ј sin e. 
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Similarly, the coordinates of the unit vector j^ are the cosines of the 
т 

2 
1' = -ising + j cosg. 


angles o + and e so that 


Since for an arbitrary point M its position vectors r = xi + yj and 
r' — x'i' + y'j' are equivalent, i.e., 
x tyjb-x'i' + y’j’, 
then substituting the expressions for i’ and j’ into the above identity, we 
obtain 
xi + yj = x' cose + j sing) + y’(—ising + j cosy) 
= (x' cos e — y’ sin p)i + (x' sing + y’ cos o)j. 





Fig. 4.5 Fig. 4.6 


Whence 
х= x' cos e — у’ sing, 
y=x' sine + y' cos е. 
Reflection of the axes of coordinates. Suppose that the Cartesian coor- 
dinate system O'x'y' is obtained from the system Oxy by reversing the 


y-axis as shown in Fig. 4.6. Then for an arbitrary point M its coordinates 
(х, у) and (x', у’) are related as 


x'-x and y’=--y. 


5—9505 
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Fig. 4.7 


Any change from one Cartesian coordinate system to another one with 
the same unit distance may be made using in succession the translation, 
rotation and reflection of the axes (Fig. 4.7). 


4.2 Curves of the Second Order 


Suppose that we are given a Cartesian coordinate system in a plane. 
A locus of points of a plane whose coordinates satisfy the equation 


F(x, y) - 0, (*) 


where F(x, y) is a function of two variables, is called a plane curve. The 
equation (*) is said to be the equation of a plane curve. 

For example, the equation x + y = 0 is the equation of a line which 
bisects the second and the fourth quadrants of the coordinaté plane 
(Fig. 4.8), and the equation x? + y? — 1 = O is the equation of a circle with 
unit radius and centre at the origin of coordinates (Fig. 4.9). 

We consider the quadratic polynomial of two variables x and y 


F(x, у) = Ах? + 2Bxy + Cy! + 2Dx + 2Еу + Е 
(A? + В? + С? > 0). 





Fig. 4.8 
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The equation 
F(x, у) = 0 


is said to be the equation of a curve of the second order. 

While curves of the first order are straight lines (and only straight lines), 
the quadratic polynomial equation provides a variety of curves of the se- 
cond order. So it is helpful to precede our study of the general equation 
of a curve of the second order with an investigation of some important 
specific plane curves of the second order. 


4.3 The Ellipse 


The ellipse is a plane curve defined by the Cartesian equation 
2 2 

Lue a 
b 


where a > b > 0. 
Equation (4.1) is called the standard Cartesian equation of the ellipse 


and the associated Cartesian coordinate system is called the standard Carte- 
sian coordinate system. 


= 1, (4.1) 





Fig. 4.10 


The circle 
х + у? = а? (4.2) 


is an еШірѕе with а = b. This enables us to regard the ellipse (4.1) as a plane 
figure obtained by uniformly compressing the circle (4.2) with coefficient 
b/a towards the x-axis (Fig. 4.10). In other words, equation (4.1) of an ellipse 
is obtained by substituting (@/b)y for y in the equation х? + у? = a’. (The 


us 
I 
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uniform compression of a circle towards the x-axis with coefficient k > 0 
is a transformation which sends an arbitrary point M(x, y) on the circle 
to the point M'(x, y/k).) 


Properties of the ellipse. (1) The ellipse (4.1) is contained in the rectangle 
P= {(х, у): | <a, |у|< 5). 


ча This property is easy to verify since 


2 2 
LINDE and xd 
a b 


for any point M(x, y) lying on the ellipse. » 
The points (+a, 0) and (0, +b) are called the vertices of the el- 
lipse (Fig. 4.11). 
| 











(Хос) e- -——-— Б- - (000) 
РА l 
"ui 
2710 | ЕЗ 
em id ! 
27 | 
e 4 
(-X0r yo) (Toryo) 
Fig. 4.11 Fig. 4.12 


(2) For the ellipse (4.1) the x- and y-axes of the standard Cartesian coor- 
dinate system are the axes of symmetry and the origin O of coordinates 
is the centre of symmetry. This means that when Mo(%, yo) belongs to 
the ellipse, so do the points ( — xo, уо), (— хо, —yo) and (хо, —yo) (Fig. 4.12). 

(3) If a » b, that is, unless the ellipse is a circle, the x- and y-axes of 
the standard Cartesian coordinate system are the unique axes of symmetry 
of the ellipse (4.1). 

Set c = Va’ — b? . It is easy to see that c < a. The points (—c, 0) and 
(c, 0) are called the left-hand and right-hand foci of the ellipse. The distance 
between the foci is equal to 2c. 

(4) The ellipse is a locus of points of a plane such that the sum of 
the distances of these points from two given points (from the foci of the 
ellipse) is constant (is equal to a given number) (Fig. 4.13). 


. 43 The Ellipse _ 5 ici o edt ME ead Drs cae doe 


+4 Let М(х, y) be a point lying on the ellipse 


X Ur e. 


I» dum 


a р? 
The distance from M(x, y) to the left-hand and to the right-hand foci 
of the ellipse are 


о= М(х + с) + у? and о -N(x-coy + у>. 
9 
22. 
OMNE S 
Fig. 4.13 


By the substitution 


2 
»-»( -=) 


we easily obtain 












p? 
Qi = (1- m)g rene 





because |x| < a and = <l. 
Similarly, we obtain 
Or = а — = x. 
Adding ог and ог, we have the desired result 
Qi + о, = 2a. 


Notice that we have proved in Chapter 1 (see Section 2, Exercise 2) that 
any point possessing this property lies on the ellipse. 9» 
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The number e =< is called the eccentricity of the ellipse (4.1). It is 


easy to see that 0 « e « 1, and e = 0 in the case of a circle. 
The lines 


x4£ 20 and x-2 =0 
e e 


are called the directrices of the ellipse (4.1) (Fig. 4.14). Any ellipse has two 
directrices. 

(5) The ellipse is a locus of points of a plane such that for any point 
of the locus the ratio of its distance from a given point (from the focus 
of the ellipse) to its distance from a given line (from the directrix lying 
on the same side of the y-axis as the focus) is constant (is equal to the 
eccentricity of the ellipse). 





Fig. 4.14 Fig. 4.15 


ча Let M(x, y) be a point on the ellipse (4.1) (Fig. 4.15). The distances 
from M(x, y) to the right-hand focus and to the right-hand directrix are 
ог= а – ех and dq == — x. 
Whence we obtain the desired result 
Ог 


d; = е. 





Similarly, we obtain 
о _ а+ех — 
зе» 
С а 
Now we consider the point (c, 0) and the line х= (с = ае). 


The distance from the point M(x, y) to the given point (c, 0) and to 


4.4 The Hyperbola ар. NOU 1 








the given line у =< are 


N(x— cy + y? and 


respectively. 
We require that 


М(х = с)? + у? _ 
@ ж 
е 








е. 


Тһеп 
“(х — cy + у? = Ja - ex]. 

Squaring the above identity and setting b? = а? — c?, we easily obtain 
x 2 
LCD 2 
a b 


Thus the point M(x, y) belongs to the ellipse (4.1). > 


= 1. 


4.4 The Hyperbola 
The Ayperbola is a plane curve given by the Cartesian equation 


è y 

LC sm], 4.3 

z y (4.3) 
where a > 0 and b> 0. 

Equation (4.3) is called the standard Cartesian equation of the hyperbo- 
la and the associated Cartesian coordinate system is called the standard 
Cartesian coordinate system. 

Properties of the hyperbola. (1) The hyperbola (4.3) lies outside the strip 
|x| < a (Fig. 4.16). 

48 It is easy to see that for any point lying on the hyperbola there holds 
x у? 
=^ = 1+2 >21. 
а? Ги 
Therefore |х| > a for any point lying on the hyperbola. 9» 
The points (xa, 0) аге the vertices of the hyperbola. 
(2) The hyperbola (4.3) lies in the interior of the vertical angles formed 


by the lines y = +? x, the x-axis being the Lisector of these angles 


(Fig. 4.17). 
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ча From the inequality 
x у? 
a" 2E p 
it follows йг for any point M(x, y) lying on the hyperbola 
р| <— P ix 


holds. » 





Fig. 4.16 Fig. 4.17 


Thus the hyperbola contains two parts called the branches of the 


hyperbola. 
The lines 
x i2 =- X 35 . 
2 + b 0 and 2 b =0 


are called the asymptotes of the hyperbola. 

(3) The hyperbola contains points infinitely distant from the origin O 
of coordinates. 
ча For example, let M(x, y) be a point on the hyperbola and let |у| = n, 
where л is an arbitrary positive number (Fig. 4.18). 

Then 


2 
_ | y аа 
|x| = а du > рр] Б" 


z tbl „a+b p 
r+y > 2 mn 3 





and 


Let us establish relations between points lying on the hyperbola and 
points belonging to the asymptotes of the hyperbola. 
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We choose a point M lying on the hyperbola (4.3) in the first quadrant 
of the standard Cartesian coordinate system and a point N belonging to 


the asymptote х - I = 0 and having the same abscissa as M (Fig. 4.19). 


These points are 


2 END and N(x, 23). 
a a 





Fig. 4.18 Fig. 4.19 


The distance between the points M and N is 
d(M, ю-2( -48-2) 
Multiplying and dividing d(M, М) by x + Ух2 + a? , we obtain 


ab 
х + Ух. – а? 


Passing to the limit we conclude that d(M, N) tends to zero as x tends 
to infinity. 

Thus we have proved the following property of the hyperbola: 

(4) If a point recedes along the asymptote from the origin of coordinates 
into infinity, i.e., if x > + co, then there exists a point lying on the hyperbola 
and having the same abscissa as the point on the asymptote so that the 
distance between these two points tends to zero. 

The converse is also true, namely: 

(5) If a point M(x, y) recedes along the hyperbola from the origin of 
coordinates into infinity, ie, if x? + y? > co, then the distance from 


d(M, N) = 





M(x, y) either to the asymptote 


A gel c 

a Ч b : 
or to the asymptote 

Хо ы нг 

а b : 


tends to zero. 

(6) The axes of the standard Cartesian coordinate system are the axes 
of symmetry for the hyperbola (4.3) and the origin of coordinates is its 
centre of symmetry (Fig. 4.20). 





Fig. 4.20 


The axes of the standard Cartesian coordinate system are the unique 
axes of symmetry of the hyperbola. 

Set c = Уа? + Б? . It is easy to see that c > 0. The points (—c, 0) and 
(c, 0) are called the foci of the hyperbola. The distance between the foci 
is 2c. 

(7) The hyperbola is a locus of points of a plane such that for any point 
the absolute value of the difference of its distances to two given points 
(to the foci of the hyperbola) is constant (is equal to a given number). 
ча The proof is similar to that of Property 4 for the ellipse. For example, 
we shall show that each point of the hyperbola possesses this property. Let 
M(x, y) be a point lying on the hyperbola (4.3) (Fig. 4.21). Then the dis- 
tances from M(x, y) to the foci of the hyperbola are 


eg2N(c-o + у> = 





c 
at—x 
a 





and 


e 7 Vx-oh €» = 





с 
4--4Х). 
а 
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Since © > 1, 
a 
с : 
Ох if xa, 
о = 
с : 
-а--х if x& -a 
a 
Fig. 4.21 
and 
с . 
-а+ 2х if xa, 
Qr — 
с . 
a-—x if х< -а. 
a 
Whence 


domes 2a if xa, 
Qe cag if x< -—a. 


Therefore we may write 
ler ол = 2a. > 
The number e = 2 is called the eccentricity of the hyperbola (4.3). It 


is easy to see that e > 1. 
The lines 


хит -0 and х- -0 


aja 


are called the directrices of the hyperbola (4.3). 
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Fig. 4.22 


Any hyperbola has two directrices (Fig. 4.22). 

The following property is similar to Property 5 of the ellipse: 

(8) The hyperbola is a locus of points of a plane such that for any point 
the ratio of its distance from a given point (from the focus of the hyperbola) 
to its distance from a given line (from the directrix lying on the same side 
of the y-axis as the focus) is constant (is equal to the eccentricity of the 
hyperbola) (Fig. 4.23). 





Fig. 4.23 


_45TheParabola ——— — — — —— . БИ 





Fig. 4.24 


The hyperbola 
x у? 
^ -= = -l 4.4 
д? р (60 
is called the conjugate hyperbola of the hyperbola (4.3). Locations of the 
hyperbolas (4.3) and (4.4) are shown in Fig. 4.24. 


4.5 The Parabola 
The parabola is a plane curve defined by the Cartesian equation 
у? = 2px, (4.5) 


where p > 0. 

Equation (4.5) is called the standard Cartesian equation of the parabola 
and the associated Cartesian coordinate system is called the standard Carte- 
sian coordinate system (Fig. 4.25). 


y 





Fig. 4.25 Fig. 4.26 
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Properties of the parabola. (1) The parabola lies in a semiplane to the 
right from the y-axis, ie, x 2 0 for any point lying on the parabola 
(Fig. 4.26). The origin of coordinates belongs to the parabola and is called 
the vertex of the parabola (4.5). 

(2) The parabola contains points infinitely distant from the origin of 
coordinates. . 

(3) The axis of abscissas of the standard Cartesian coordinate system 
is the unique axis of symmetry of the parabola (4.5) (Fig. 4.27). 

The axis of symmetry is called the axis of the parabola. 

The number p is sometimes called the /оса/ parameter of the parabola. 


The point e 0) is the focus, and the line x = — Р is the directrix of 


2:7 2 
the parabola (4.5). 
(4) The parabola is a locus of points of a plane such that for any point 
its distance from a given point (from the focus of the parabola) is equal 
to its distance from a given line (the directrix of the parabola) (Fig. 4.28). 











Fig. 4.27 Fig. 4.28 
* Let M(x, y) be a point lying on the parabola (4.5). The distances from 
M(x, y) to the focus е Ё 0) and from M(x, y) to the directrix x = —Ё are 


N 





respectively. 
Substituting 2px for y?, we easily obtain 
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The converse is also true Let M(x, y) be a point equidistant from the 
point e А o and from the line x = — 2 ie, 


2 2: 
2 
zu ? = 
(12) 


Squaring the above identity, we easily obtain 


2 





х+2). 


у? = 2рх. > 


4.6 Optical Properties of Curves of the Second Order 


Tangents to ellipses and hyperbolas. Let y = f(x) be an equa- 
tion of a plane curve. Then the equation of a tangent to the plane curve 
at a point (xo, yo), where yo = f(x), may be written in the form 


y — yo = f'o) — xo). (4.6) 
Let Mo(xo, yo) be a point lying on the ellipse 
2 2 
x y 
“rtie 
а? Бр? 





Fig. 4.29 


For definiteness we assume that the point Мо lies in the first quadrant 
of the coordinate plane, i.e., xo > 0 and yo > 0 (Fig. 4.29). 
The part of the ellipse in the first quadrant is given by the equation 


y-b к=; 


Using (4.6), we obtain the equation of a tangent to the ellipse at the 
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point Mo(xo, yo) 


xob 
y- y= (Х-%). 


Since the point Мо(хо, yo) belongs to the ellipse, 





2 
х 
у = Б pe 
a 
Whence we have 
xb? 
у- yo 7 - —— (x - x). 
а уо 


The above relation may be presented in the form 


2 2 
хх | o - (3 +) =0. 











a? b? a р? 
Since 
x, ME 
a? b? ? 
we arrive at the equation 
E EE 


This is the equation of the tangent to the ellipse at the point (xo, yo) 
which is independent of the location of Мо in the coordinate plane. In other 
words, this equation is satisfied by the Cartesian coordinates of each point 
lying on the ellipse. 

Similarly, we may easily derive the equation of a tangent to the hyperbo- 
la in the form 

Xo _ J _, 
а? b? i 
the point (xo, yo) lying on the hyperbola. 

Tangents to parabolas. Let x = g(y) be an equation of a plane curve. 
Then the equation of a tangent to this plane curve at a point (xo, yo), where 
Xo = р(уо), may be written in the form 


X — Xo = 8'00 — yo). (4.7) 


Let Мо(хо, yo) be a point lying on a parabola. Using (4.7), we obtain 
the equation of a tangent to the parabola at Mo(xo, yo) in the form 


x — % = m Qr - y») 
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or 2 
уу» — Yo + рхо — px = 0. 

Whence, observing that yo = 2pxo, we arrive at the equation of a tan- 

gent of the form 


у = р(х + ж). 


Remark. Comparing standard Cartesian equations of the ellipse, hyper- 
bola and parabola with the equations of respective tangents we see that 
the latter can easily be obtained from the former. Indeed, substituting yyo 
for у? and x% for x? in the equations of the ellipse and hyperbola, and 
X + Xo for 2x in the equation of the parabola, we immediately arrive at 
the equations of the respective tangents. Notice that the point with coor- 
dinates (xo, yo) lies on a plane curve. 

Optical property of the ellipse. Let Мо(хо, yo) be a point lying on the 
ellipse 

x eee 
а Б? 


Recall that the distance from Мо to the foci F; апа Р, of the ellipse are 


= ]. 


о = lexo + a| and о, = |е — al, 


respectively. 

We draw through Mo a tangent line given by the equation 
XXo УУд 

— + = 1, 

а? b? 











Fig. 4.30 


It is easy to find the distances from the tangent to the foci Fi(—c, 0) 
and F,(c, 0) (Fig. 4.30). We have 


сан and A&-2gx|—--1 


h = 
p а? 








CXo 
, 
2 
X? . XP a a 
where p= | a + pt 15 а normalizing factor. 


6 9505 
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We can easily verify that 


























h_h 
о! Qr. 
Indeed, 
с 
JE 
LPS E Qi eru. 
о lexo + a| а le *a| a 
and 
СЭР 
p 5 
h _ a* ES 
Qr lexo — a| а 
Fig. 4.31 


` Fig. 4.32 


We see from Fig. 4.30 that the above ratios are equal to the sines of 
the angles made by the segments Мо and F,Mo with the tangent. Since 
the sines are equal so are the angles. Therefore a tangent at any point lying 
on an ellipse makes equal angles with the line segments joining this point 
with the foci of the ellipse. This property is sometimes called the optical 
property of the ellipse because the rays from a source of light placed at 
one of the foci are reflected from the “mirror” surface of the ellipse and 
are focused into the other focus (Fig. 4.31). 
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Optical property of the hyperbola. Similarly, we may establish the fol- 
lowing optical property of the hyperbola: the rays from a source of light 
placed at one of the foci seem to emanate from the other focus after being 
reflected from a “mirror” surface of the hyperbola (Fig. 4.32). 


(a) (5) 
Fig. 4.33 
Optical property of the parabola. If a source of light is placed at the 


focus of a parabola the rays from this source are reflected by the “parabolic 
mirror" in directions parallel to the axis of the parabola (Fig. 4.33). 


4.7 Classification of Curves of the Second Order 


We begin our classification with the discussion of plane curves given 
by quadratic polynomials. 
Theorem 4.1. Let us set up a Cartesian coordinate system Oxy in a plane 
and let 


f(x, y) = ax? + 2bxy + cy! + 2dx + 2ey + g 
(а? + b? + с? >0) (4.8) 


be a quadratic polynomial of two variables x and y. 

Then there exists a Cartesian coordinate system O'XY such that sub- 
stituting X for x and Y for y reduces the polynomial f(x, y) to the poly- 
nomial F(X, Y) which may be of the following three kinds: 


(a) AX? + BY? + С, А.В #0; 
(b) BY? + 2DX, B-D#0; 
(с) BY? + E, B ж 0). 


44 Step I. Ву a suitable rotation of the x- and y-axes we may eliminate 
the term 2bxy from f(x, y). 
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Let b #0. 

Consider a Cartesian coordinate system Ox' y' obtained from the origi- 
nal system Oxy by rotating the x- and y-axes through an angle o (Fig. 4. 34). 
The new and original axes of coordinates are related as 

X = x’ cos o — y’ sing, (4.9) 
y = x' sing + y’ cos e. 

Substituting (4.9) into (4.8), we reduce the term 2bxy to the term 

2b'x'y', where 
2b’ = 2(—a sin e cos e  b(cos? p — sin? e) + c sin e cos g) 
= (c — а) ѕіп2ф + 2b cos 20. 

To eliminate 2b'x'y' from the polynomial f(x’, у’) it suffices to set 

2b' 2 0. Whence we have 


а-с 
cot 29 = 





2b 








Fig. 4.34 Fig. 4.35 


Observe that a, b and c are known. From the above identity we may 
find the value e of the angle through which we should rotate the original 
axes of coordinates to eliminate the term 2b'x'y' from the polynomial 
Ах’, y'). In other words, we may always choose а Cartesian coordinate 
system such that the original polynomial is reduced to the form 


f(x, y) = ах? + cy? + 2dx + 2ey + g, 


where a” + c? > 0, and in what follows we shall consider the above poly- 
nomial. For definiteness, we also set c # 0, since substituting y for x, we 
may always reduce the polynomial to a form with c # 0. 
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Step 2. By a suitable translation of the axes of coordinates we may fur- 
ther simplify the polynomial f(x, y). 

Consider a Cartesian coordinate system O' XY obtained from the sys- 
tem Oxy by translating the x- and y-axes (Fig. 4.35) in which case 


X=x+a, 


where —« and — are the coordinates of the origin O’ of the system 
O' XY in the original system Oxy. 

Depending on the relations between the coefficients a, b, c, d and e, 
the polynomial f(x, y) reduces to one of the following three kinds 


(1) a = 0, c = 0. Then setting a =a and 8 = T we obtain 


F(X, Y) = AX? + BY? +C, 
а? е? 
where А = а, В = b, and Свая 


(2 a = 0, d z 0. Then setting 
zs dd _ её? е 
«= 33 (e =) mes 
we obtain 


F(X, Y) = BY? + 2DX, 


where B = c and D = d. 
(3) a = d = 0. Then setting 


а-0 and 8-2 


we obtain 
F(X, Y) = BY? + E, 
2 
where B = c and E=g-—. > 


Standard equations of curves of the second order. We consider plane 
curves of the second order defined by the polynomial equation 


F(X, Y) = 0, 


where F(X, Y) is a quadratic polynomial! discussed in tne preceding 
theorem. | 

Depending on the kind of F(X, Y) the polynomial equation determines 
equations of plane curves of different shapes. Let us consider them 
separately. 
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() F(X, Y) = AX! + BY’? +C=0,A-B#0. 

Assume that 4-8 > 0. Multiplying the polynomial equation by (-1) 
and substituting Y for X and X for Y, if necessary, we may reduce the 
equation to a form with B > А > 0. Then 

(a) If C < 0, we obtain an equation of the ellipse 

2 4 2 
mc + E = 1, 
а Ь 
2-2 c 2 _ 
where а -—<,Ь -$ (а> b > 0). 

(b) If C> Ч we а an equation of the imaginary ellipse* 

X ү? 
PUE E + jo ee 
а? b? 


where 2-6 апа pt (a > b » 0). 


= –1, 


Notice that in the X Y-plane there exists no point whose coordinates 
satisfy this equation. 
(c) If C = 0, we obtain the equation 
x y 
—À + —— = 0, 
а? b? 
where a бе апа pl (a > b > 0). 


The origin of ee is the only point whose coordinates satisfy 
this equation. We may regard the origin of coordinates as a point of inter- 
section of two imaginary lines**. 

Assume that A · В < 0. Multiplying the polynomial equation by (— 1) 
and substituting Y for X and, if necessary, X for Y we may always reduce 
the equation to a form with A > 0, B < 0 and C « O. 

Then 

(a) If C « 0, we obtain an equation of the hyperbola 

x ү І 
а 


С 2 _С 
where a^ = А and b B (а > 0, Ь> 0). 


? This curve is called the imaginary ellipse since its equation resembles the equation of 
the real ellipse. 

*? We speak of imaginary lines since the respective equation resembles the equation 
defining two intersecting lines in a plane. 


4.7 Classification of Curves of the Second Order 87 


(b) If C = 0, we obtain the equation 


2 2 
27254 
а b 
where à? = 1 and b? = – L, 
A B 
This equation defines two intersecting lines 
X Y X Ү 
Z 5 0 and zt’ 
in a plane. 


(2) F(X, Y) = BY + 2DX = 0, B-D #0. 

Assume that B - D < 0. Substituting if necessary —X for X we always 
reduce the polynomial equation to a form with B- D < 0. Then we obtain 
an equation of the parabola 


Y? = 2px, 


where p= - 2 (p » 0). 


(3) F(X, Y) = BY’ -E-0, В = 0. 
Assume that B > 0. Then 
(a) If E « 0, we obtain the equation 


yY,-c-0, 


where c? = — 5 (c > 0). 


This equation defines two parallel lines in a plane. 
(b) If E > 0, we obtain the equation 


Y +c? = 0, 
where c* = 5 (c > 0). 


There is no plane which contains a point whose coordinates satisfy this 
equation, called the equation of two imaginary parallel lines because it 
resembles the equation defining two parallel lines. 

(c) If E = 0, we obtain the equation 


y 50 


which defines two coinciding lines in a plane. 

We may determine what type of plane curve a polynomial equation 
represents without making manipulations as given above. It suffices to de- 
fine the signs of some expressions involving the coefficients of the poly- 
nomial equation in question. 
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Table 4.1 Classification of Curves of the Second Order 








Pair of imaginary intersecting 
lines 


Pair of parallel lines 

0 Pair of imaginary parallel 
lines 
Pair of coinciding lines 





Shape 
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For the polynomial equation 
ах? + 2bxy + су? + 2dx + 2еу + g = 0 


the criteria to determine the type of the plane curve аге the signs of 


abd 
D- - i and A- b ce 
b c d e g 





The numbers D and A are called invariants since they are independent 
of the Cartesian coordinate system set up in a plane. 

Table 4.1 shows a classification of plane curves of the second order in 
terms of D and A. 
4.8 Surfaces of the Second Order 


Suppose that we are given a Cartesian coordinate system in three- 
dimensional space. A set of points whose Cartesian coordinates x, y and 
z satisfy the equation 


F(x,y,z2)-0 (*) 


is called a surface in three-dimensional space. The equation (ж) is called 
the equation of a surface. 


~A A y 


Fig. 4.36 
Example. The equation 
x-y-z2-agq-0 (а> 0) 


is an equation of a sphere with radius ғ and centre at the point (0, 0, 0) 
(Fig. 4.36). 
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Consider the quadratic polynomial of three variables x, y and z 
F(x, y, 2) = ax? + 2012ху + 2o13Xz + оооу? + 2orsyz 
+ 0332? + 2014Х + 2004У + 20342 + ода = 0, 
ati + of + ats + а + 023 + 033 > 0. 
The equation 
F(x, у, 2) 20 


is said to be the equation of surface of the second order. 

This equation determines a rich collection of surfaces. Its investigation 
is a much more complicated task than that of analyzing the equation of 
a plane curve and requires special techniques and procedures. This will be 
made in Chapter 6. Here we confine ourselves to classification of surfaces 
in general and to standard equations of major surfaces of the second order. 
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Surfaces of revolution. We consider a curve y (Fig. 4.37) defined 
in the xz-plane by the equation 


z-f() (x20) 


z 





si 








r 


Fig. 4.37 Fig. 4.38 


Revolving the curve y around the z-axis generates a surface called the 
surface of revolution (Fig. 4.38). 

We find the equation of this surface, ie, the equation to be satisfied 
only by points lying on the surface. 

Let Мобо, 0, zo) be an arbitrary point on the curve y (Fig. 4.39). When 
the curve гү revolves around the z-axis the point Mo moves in a plane which 
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passes through Mo and is perpendicular to the z-axis. The path of moving 
is a circle 


P+ Pax 


with radius equal to the abscissa Хо of Mo. 
Notice that for the given curve у 20 = f(xo). Therefore the coordinates 
of any point on the circle are related as 


о = (Мә? ty). 


It is easy to see that this reasoning is fully applicable to every point 
on the curve y. Whence we infer that the surface of revolution is defined 
by the equation 


z=f(V + yt). 





My(Xo,0, Zo) 


Fig. 4.39 Fig. 4.40 


Cylindrical surfaces. Suppose that we draw through each point of a 
given curve y a line / parallel to a given line 4. A totality of lines drawn 
in this way through points of the curve y, ie, a set of points lying on 
parallel lines drawn through points of the curve y is called the cylindrical 
surface. The curve гү is called the directing line of the cylindrical surface 
and any line which passes through a point on the directing line and is 
parallel to the line 4 is called the generating line of the cylindrical surface 
(Fig. 4.40). 

We find the equation of the cylindrical surface. 
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Consider a plane т which passes through an arbitrary point О and is 
perpendicular to the generating line of the cylindrical surface (Fig. 4.41). 

Let us set up a Cartesian coordinate system Oxyz with origin at O and 
the z-axis perpendicular to the plane т. Then the plane т becomes a coor- 
dinate plane, ie, the xy-plane. 

It is clear that the line of intersection of the cylindrical surface and 
the plane т is the directing line +o. 

Let F(x, y) = 0 be the equation of the directing line yo. 

We show that the above equation may be considered the equation of 
a cylindrical surface. 








Fig. 4.41 Fig. 4.42 


ча Indeed, if a point (x, у, 2) lies on the cylindrical surface, then the point 
(x, y, 0) belongs to the directing line yo (Fig. 4.42). Hence the point 
(x, y, 0) satisfies the equation 


F(x, y) = 0. 
On the other hand, this equation is also satisfied by the coordinates x and 
y of the point (x, y, z). Therefore we may regard the equation 

F(x, y) = 0 
as the equation of the cylindrical surface since it holds true for any point 
on this surface, » 


Example. Let there be given a Cartesian coordinate system Oxyz in 
three-dimensional space (Fig. 4.43). The equation 
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è y 
2 № 
defines а cylindrical surface called the elliptic cylinder. 
Remark. The equation 
ЁО, 2) = 0 


defines a cylindrical surface with directing line parallel to the x-axis, and 
the equation 


F(x, z) = 0 


defines a cylindrical surface with directing line parallel to the y-axis. 





Fig. 4.43 Fig. 4.44 


Conic surfaces. Suppose that we are given an arbitrary curve у and a 
point O outside y. Let us draw lines through O and every point of y. A 
totality of lines thus obtained, that is a set of points lying on these lines, 
is called the conic surface with the directing line y and vertex О (Fig. 4.44). 
Any line passing through the vertex O and a point on the directing line 
y is called the generating line of the conic surface. 

Consider a function F(x, y, z) of three variables x, y and z. 

The function F(x, y, z) is called a homogeneous function of degree 
q if for any t > 0 there holds 


F(tx, ty, tz) = t*F(x, у, 2). 
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Let us show that given a homogeneous function F(x, y, z) the equation 
F(x, y, 2) = 0 


defines а сопіс surface. 
- Indeed, let 


Fo, Jo, Zo) = 0, 


ie, a point Мо(хо, уо, zo) lies on the surface defined by the equation 
F(x, y, 2) = 0. 


Mo{Xo.Yos7o) 





Fig. 4.45 Fig. 4.46 
We set F(0, 0, 0) = 0 and draw a line / through the points Mo and 
ООО, 0, 0) (Fig. 4.45). The line / is given by the parametric equations 
x= іж, Y= уо, Z= і. 
Substituting the parametric equations into F(x, у, 2), we obtain 
F(x, у, 2) = Е@ж, tyo, tz) = t"F(Xo, Yo, %) = 0. 
This means that the line / belongs to the surface defined by the equation 
F(x, y, Z) = 0. 


Hence this equation defines a conic surface. 9» 
Example, The function 
у 2 


42 
Е(х, , ——3 4. 
(х, у, 2) 2 Б? 2 
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is a homogeneous function of degree 2, i.e., 
(x , (Фу) zy 





F(tx, ty, tz) 





a? b? с? 
x? y? z? 
2 2 
= (5 + "E =r) = F(x, y, 2). 


Whence we conclude that 
х2 у? 2 


a bc 


is an equation of a conic surface (Fig. 4.46). 


4.10 Standard Equations of Surfaces of the Second Order 


Ellipsoids. A surface of the second order given by the standard 
Cartesian equation 





Fig. 4.47 


To investigate the shape of the ellipsoid we revolve the ellipse 


X 12.11 
а? с? 


around the z-axis (Fig. 4.47). This leads to a surface 


ху! 4? 


called the ellipsoid of revolution, which gives an idea about the shape of 
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the ellipsoid. It suffices to compress the ellipsoid of revolution along the 


y-axis with the compression coefficient 2 < 1 to get the ellipsoid given 

a 

b 

y in (ж), we obtain the standard Cartesian equation of the ellipsoid*. 

Hyperboloids. Revolving a hyperbola given by the equation 

woe 
a с? 

around the z-axis generates a surface called the one-sheet hyperboloid of 

revolution (Fig. 4.48) which is defined by the equation 

2 


by the standard Cartesian equation. In other words, substituting -- y for 


= 1 


x+y z ер 
а? с? : 


2 
: “ЗӨН ЭР 
x 


Fig. 4.48 


By compressing a one-sheet hyperboloid of revolution uniformly with 
compression coefficient < 1 along the y-axis, we obtain a Ayperboloid 


of one sheet given by the standard Cartesian equation 


x 2 2 
zt 2. x 5, = 1. 
а b с 
The standard Cartesian equation of the hyperboloid of one sheet is easily 
obtained from the equation of the one-sheet hyperboloid of revolution by 
a 
b 





substituting — y for y into the latter equation. 


? The ellipsoid of revolution is also obtained by uniformly compressing a sphere 


xX + y? + 22 = а? with the compression coefficient = < 1 along the z-axis. 
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Revolving a conjugate hyperbola given by the equation 


Dee Pose? as! “2 
a cu 
around the z-axis generates a two-sheet hyperboloid of revolution 
(Fig. 4.49) 
Pty Ф m 
а? e 


As before, by compressing a two-sheet hyperboloid uniformly with com- 
pression coefficient? <1 along the y-axis, we obtain a hyperboloid of two 


Sheets given by the standard Cartesian equation 


х? MN NE 


T == 
l bB c 











Fig. 4.50 


Elliptic paraboloids. Revolving a parabola given by the equation 


2 
x^ = 2pz 
around the z-axis generates a paraboloid of revolution (Fig. 4.50). 


x + у? = 2pz. 


7—9505 
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By compressing a paraboloid of revolution uniformly with compression 
coefficient Е < 1 along the y-axis, we obtain an elliptic paraboloid given 


by the standard Cartesian equation 
2 2 
E + D = 24. 
p q 
It is easy to see that the standard equation of the elliptic paraboloid 


is obtained by substituting e y for y in the equation of the paraboloid 


of revolution 
х? + » 





22x. 


When p « 0, the standard Cartesian equation describes the elliptic 
paraboloid shown in Fig. 4.51. 





Fig. 4.51 Fig. 4.52 


Hyperbolic paraboloid. A surface of the second order given by a stan- 
dard Cartesian equation of the form 
2 2 
x y 
TC =, (9) 
р q 
where p > 0 and q > 0, is called a hyperbolic paraboloid. 

We shall investigate the shape of this surface by applying the following 
technique. Draw planes parallel to coordinate planes. These planes, called 
Sections, intersect the surface in question along plane curves. Mapping the 
lines of intersection onto the coordinate planes, we obtain families of lines 
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whose structures, i.e., shapes and mutual locations of the lines on the coor- 
dinate planes, enable us to make a conclusion on the shape of the surface 
itself. 
Let us start with sections z = h=const parallel to the xy-plane. Depend- 
ing on the values of Л, we obtain three families of intersection lines, namely 
(a) a family of hyperbolas 








х? у? 
ЭН =1, 
(2phy (аһ)? 
where й > 0; 
(b) a family of conjugate hyperbolas 
(V-2ph | (V-2qhy 
where h « 0; 
(c) two intersecting straight lines 
2 2 
x y 0, 


W wa 
provided that й = 0. 


Fig. 4.53 


Notice that these straight lines are asymptotes of all the hyperbolas of 
the families (a) and (b), i.e., they are asymptotes of hyperbolas for any value 
of h distinct from zero. 

Mapping the intersection lines onto the xy-plane, we obtain the family 
of lines shown in Fig. 4.52 from which we infer that the surface in question 
is of a saddle shape (Fig. 4.53). 


‚1 
і 
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Now we cut the surface by planes у = A. Substituting й for y in (ж), 
we obtain a family of parabolas in the xz-plane 


2 р? 
x” = 2р{ 4 + —— |, 
á ( 3р) 
as shown in Fig. 4.54. 
Similarly, cutting the surface by planes x = Л, we obtain a family of 


parabolas in the yz-plane 


аж) 
2ра }' 


as shown іп Fig. 4.55. 





Fig. 4.54 





Fig. 4.56 


From Figs. 4.54 and 4.55 we conclude that a hyperbolic paraboloid 
(Fig. 4.56) is obtained by translating a parabola x? — 2pz along a parabola 
у? = —2qz or by translating у? = —2qz along x? = 2pz. 

Remark. Intersecting a surface by a plane parallel to coordinate planes 
is fully applicable to the analysis of all surfaces considered above. However, 
revolving plane curves of the second order and further compressing surfaces 
thus obtained is a much easier way to investigate surfaces of the second 
order. 
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Cylinders. Recall that the shape of a cylinder (of cylindrical surface) 
is determined by the shape of its directing line. Here we enumerate the fol- 
lowing kinds of cylinders we have encountered in the preceding sections. 
These are 








Fig. 4.57 Fig. 4.58 


(a) elliptic cylinder (Fig. 4.57) 


2 у? 
bcd, 
а? b? 

(b) hyperbolic cylinder (Fig. 4.58) 
xot. 
wo go 


(c) parabolic cylinder (Fig. 4.59) 
y! = 2px. 





Fig. 4.59 Fig. 4.60 
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Cones of the second order. A surface given by the standard Cartesian 


equation 
x у z? 
AES. 
a b с 


is called а сопе of the second order. 
We may investigate the shape of this surface either by revolving two 
intersecting straight lines 


х? 2 
a? c 


=0 


around the z-axis and further compressing the surface thus obtained or 
by intersecting the cone in question by planes parallel to coordinate planes. 
In both cases we infer that a cone of the second order is of the shape shown 
in Fig. 4.60. 


Exercises 
2 2 


x y 
1. For the hyperbola ^9 ^6 
foci, (b) the eccentricity, (c) the equations of asymptotes and directrices, 
(d) the equation of the conjugate hyperbola and its eccentricity. 
2. Write down the equation of a parabola provided that the distance from 


the focus to the vertex is equal to 3. 


— ] find; (a) the coordinates of 


2 2 
3. Write down the equation of the tangent to the ellipse ze + eae 


32 18 
at the point M(4, 3). 
4. Identify the types and locations of plane curves given by the equations: 
(a) х2 + 2у + 4х – 4у = 0; (b) бху + 8y? — 12x - 26y + 11 = 0; 
(с) x? – 4ху + 4y? + 4x – Зу - 7 = 0; (d) ху+х+у= 0, (е) хХ- 
5ху + 4y? + x + 2y — 2 = 0; (f) 4x? — 12ху + 9? – 2x + 3y – 2 = 0. 


= 1 


Answers 
E А 275 БУ нийн, СЭ, ин" 
1. (а) Fi(— 5, 0), Б 65, 0); (b) e = 3 , (Фу = +35 = $55 (4 5- dé ^ 1, 
5 2 pe La ЗҮ? : 
e- үй 2. y^ = 12x. 3. 3х + 4y — 24 = 0. 4. (a) The ellipse S + aS 1 with centre 
х? ү 
at O'(—2, 1) and the major axis О’ X parallel to the x-axis; (b) the hyperbola a Gra ls 1 


with centre at O'(—1, 2) and the tangent of the angle between the axis O' X and the x-axis 


equal to 3; (c) the parabola Y? — = X with vertex at O' (3, 2), the vector of the О’ X-axis 


directed to the vertex is ( — 2, —1); (d) the hyperbola with centre at O’(=1, 1), the asymptotes 
are parallel to the x- and y-axes; (e) a pair of intersecting straight lines x — y — 1 = 0 and 
x — 4y + 2 = 0; (f) a pair of parallel straight lines 2x — 3y + 1 = Oand 2x — 3y – 2 = 0. 


Chapter 5 


Matrices. Determinants. 
Systems of Linear Equations 


5.1 Matrices 


Definitions. An m x n matrix is an array of m: n numbers a; (i = 1, 


2, ...,m; 1, 2, ..., n) arranged in the rectangular form 
O1 012 Qin 
BN oc ec Pe (5.1) 
Qm1 Om2 Omn 


The numbers oj; (i = 1, 2, ..., m; j = 1, 2, ..., п) are called elements 
or entries or coefficients of A. 
The horizontal n-tuple of numbers 


ал, «р, ..., Qin (= 1, 2, ..., m) 


is called the ith row of the matrix A. 
The vertical m-tuple of numbers 


Qij 
a2; 
G=1, 2, ..., п) 


is called the jth column of the matrix A. 

Therefore each m x n matrix has m rows and n columns. The element 
ош occupies the position where the ith row and jth column meet. 

The numbers і and j indicate the position of the element aj in A and 
may be thought of as the coordinates of oj; in A (Fig. 5.1). A concise nota- 
tion for matrices of the form m x n is 


A = (aij). 


A 1 x n matrix is called a row-vector and an т х 1 matrix is called 
a column-vector. 


104 5. Matrices. Determinants. Systems of Linear Equations 


When m = n, the matrix 


ai 912 Qin 

21 022 a2, 
A= : 

Qn} Сїл2 ... Onn 





Fig. 5.1 


For example, the matrix A = (олу) containing a single element is a 
square matrix of order 1. 
The n-tuple of numbers 


Q11, (22, ..., Qnn 


is called the principal diagonal of the matrix A. 
An m x n matrix whose elements are zeros is called a zero matrix. 
A square matrix of the form 


1 0 0 
pep. 23:22 62) 
0 0 1 


is called an identity or unit matrix. 

For any m x n matrix there exists a zero matrix and for any square 
matrix of a given order n there exists a unit matrix. 

We shall denote the set of all matrices of the type m x n by R"*" with 
the understanding that we are concerned with matrices whose elements are 
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real numbers. Since a set of real numbers is conventionally denoted by R, 
the notation IR" х" signifies that we consider a set of all m x n matrices 
of real numbers. In Chapter 26 we shall consider a set of m x n matrices 
of complex numbers denoted by С” х" because a set of complex numbers 
is conventionally denoted by C. 

To signify that matrix (5.1) is of the type m x n we shall write 


А = (ai) € R™™”. 


The matrices A = (ay) and B = (8y) are called equal if they are of the 
same type and their elements occupying identical positions coincide, i.e., if 


AER™*", Вер" х" 
апа 
ау = Ву ( = 1, 2, ..., т; ј = 1, 2, ..., п). 
In symbols, we write A = B. 
Now we shall turn our attention to arithmetic operations on matrices. 
Addition of matrices. Let A and B be two matrices of type m x n, that 
is, 
А = (ay) € R"*" and В = (Bi) € R&"*", 
The sum of matrices A and B is the matrix С = (yj) € R"*" whose 
elements are 


үй = ay + бу (i = 1, 2, .. M; J=1,2,..., n). (5.3) 


In symbols, we write C = A + B. 
Multiplication of a matrix by a scalar. The product of a matrix A = 
(ai) € Р" х" by a scalar À is the matrix В = (8y) € Р" х" whose elements are 
By = day ( = 1, 2, ...,m; ј = 1, 2, ..., n). (5.4) 


In symbols, we write В = dA. 
By way of illustration we show how these operations are performed by 
using notation (5.1): 


O11 012 ... Ойл Bu Ba . bın 
021 09092 Q2n n B 82 Bon 
Om1 Om2 Omn Brat Bm2 Bma 
oui + Bi 024082 ... aint Bin 
azı + B21 an + Вә ... On + Ban 


Om] + Вт Om2 + Bm2 24. Omn + Втп 
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O11 012 a seis Oln Хол Хог. E Хол 
A O21 022 ... Q2n ЕЕ har Àa22 -— Хон 
Om| От? Omn Aami Мат2 AOimn 


Multiplication of matrices. Let A = (ок) and B = (6x;) be two square 
matrices of order n. The product of A and B is the matrix C = (үд) € Р" х" 
whose elements are 


үу = aufi + оро) +... + Qinbnj (5.5) 
à, 1,2, eun) 


In symbols, we write C = AB. 
By way of illustration we write this operation as 


aii 12 ... Qj ... Qin 
21 922 ... a2 eee Q2n 
I Qtil Qi2 Qij Ойл 
On “л? ал} Onn 
J 
Bu Br fj Bin 
ёл 82 Вэ) Bon 
И I E rac a 
8йл Вә Bi Bin 
Bu 8.2 В»; Ban 
J 
Үп Ү12 wu Yin 
¥21 22 Ү2; Y2n 
Е Yi Yi2 Yij Yin 1 
Үп1 Үп... үп -.. Ynn 


Notice that the sum of n eleivents (numbers) 
Qi t O2... tart... + On 


is conventionally written in the form 


Mes 


Qj. 


i=1 
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Then the operation of multiplication of two matrices may be conveniently 
presented as 


n 


yj = У: Gik(k; (i, J = 1, 2, ..., n). 


In general AB z BA. 
Examples. (1) Let 


_ [01 _ [00 
a= (oo) and в-(10) 


Then 


_ [10 _ [00 
aB = (50) and ва = (00). 


И is not hard to find similar examples for matrices of any order. 
(2) Let A be a square matrix of order 3, i.e., 


ai qc аз 
А = { о o az 
оз 032 азз 


ча We show that pre-multiplying the matrix А by the matrix 


1 0 0 
Р,-10 0 1 
0 1 0 


interchanges the second and third rows in A. 
We have 


1 0 0 оп an2 аз 
РА = 0 0 1 Ч O21 022 023 
0 1 0 031 032 033 


l:oii* 0:021 Ооз 1:012 + 0-022 0-032. 1:013--0-а023-4 0-033 
= | 0:011+ 0:021 + 1-031. 0-ол; + 0:022 + 1:032. 0-013 + 0-053 + 1:033 
0-ол1+ 1:921 0:031. O'an 1:022 + Otas 0-олз + 1-a23 + 0:033/ 


Similarly, it is easy to verify that post-multiplying the matrix А by P23 
interchanges the second and third columns in A. 


108 5. Matrices. Determinants. Systems of Linear Equations 


(3) For any matrix A there holds 
ГА = A-I ='A, (5.6) 


where I is an identity matrix. 
~ Let A be a square matrix of order 3 


ai 12 013 
А = 021 022 023 


Then 
ап оз 013 1 0 0 
А І = 021 022 023 0 I 0 
Q31 Q32 Q33 0 0 1 


œil + 02:04 аз:0 ayy-0 + anl a13:0 о11:0 + o15:0 + аз:1 
=| oar] + a22°0 + 023-0. 0541-0 + 22:1 + 02370 021-0 + 22-0 + 0023:1 
œz l + 32-0 + озз:0 a3,:0+ 032-1 + 033:0 a31:0 + 035:0 + 033:1 


031 032 Q33 
It is easy to verify in the same way that 
ГА = А. » 


Relation (5.6) explains why the matrix I is called the identity matrix. 
The operation of matrix multiplication obeys associative and distribu- 
tive laws, namely if A, B, C and D are square matrices of order n, then 


(AB)C = А(ВС), 
AB + C) = AB + AC, 
(В + C)D = BD + CD. 


ча By way of illustration we show that A(B + C) = AB + AC. It is clear 
that the three matrices AB, AC and A(B + C) are of the same order л. 
Their elements in the (7, /)th position are, respectively, 


n 


D aikBkj, 


n 
У! QikYkjs (i = 1, 2, ..., п). 
k=1 


n 


D ou(Bkj + үк), 
k=1 
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Then we obtain 


n n л 
У) а*(Ву + үк) = ХУ) окбк + 2 QikYkj 
=1 = = 
and this proves the property in question. 
Similar reasoning is applicable to the other two identities. > 
Remark. The operation of matrix multiplication may also be defined 
for rectangular matrices. 
“4 Let there be given matrices А = (ак) € IR"*" and B = (Bj) € Р" х". 
Then the elements of the matrix C = AB € R"*' are 


n 


w= У, окбы (-1,2,...т:1-1,2, ..., D. (5.7) 
1 


Therefore the product of two rectangular matrices may be defined only 
when the number of columns in the first factor is equal to the number 
of rows in the second factor (Fig. 5.2) as is easily seen from (5.7). > 


e = 4x5 
Е 


Fig. 5.2 


Products of rectangular matrices obey the laws (5.6) provided that the 
corresponding operations of matrix multiplication make sense. 
Example. Compute the product of the matrix 


9 5 1 0 ] 2 
А-1 1 9 by the matrix В = : 
8 6 1 9 4 2 


ча Observe that the number of columns in A is equal to the number of 
rows in B. This means that we may multiply A by B. Computing the 
product, we obtain 


9 5 1 0 1 2 
АВ-(1 9]. 
8 6 1 9 4 2 
(123 9.0+ 5:9 9.1454 2292) 


141-941 L0-9.9 11+9:4 1:2 + 9:2 
8-1 +61 8.0-6.9 8.11464 8.2 + 6:2 


14 45 29 28 
={ 10 81 37 20 |]. > 
14 54 32 28 
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Summing up elements of a matrix. In applications we often need to 
compute the sum of elements of a rectangular matrix, say the matrix A 


11 012 Ойл 

O21 022 O2n 
A mu cece ten tee es caste tel Ue үн 

Qm1 Om2 Omn 


We may compute the sum H of elements of A either by computing 
column sums 


m m m 
2 Qil, = Qi2, ..., > Qin 


i=] і= 1 i=l 


and adding them together so that 


H = Хайж DICEN. OT Y (26) 


i=l i=l i=l j=1 


or by finding row sums 
n n n 
У о, 2j бор ... 2j Amy 
j=l j=l j=l 
and adding them together so that 


n n n m n 
Н = Хоу + У 02) +... + È аъ = È ( È a). 
j=1 j 


j=1 j=1 i 


M 
- 


Whence we obtain 


n m m n 
5 àoej- à) Хов 
j=1 i=l i=1 j=l 
Transposition of matrices. The matrix 
Ou 001 00 Omi 
a12 Q22 От2 5, 
Qin O2n Отп 


[^ 4 61 012 Gin 
@21 022 ... Q2n 

A= can В (5.1) 
Om1 Om2 Qmn 


In symbols, we write A’ to denote the transpose of A. 
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Example. By definition the transpose of 
111 2314 
Ac C 6 7 s) 


3 1 5 
2 6 
3 7 


А’ = 
4 8 


It is important to observe that the element of A’ in the (i, /)th position 
coincides with the element of A in the (j, д: position. The operation of 
transposition interchanges rows and columns of the matrix A so that rows 
in A become columns in A’ and columns іп A become rows іп A’, There- 
fore if the matrix A has m rows and n columns, the transpose A’ of A 
contains n rows and m columns (Fig. 5.3). 





Fig. 5.3 


The operation of transposition satisfies the following conditions: 
(а) (A')' =A, 
(D (A + B)’ = A’ +B’, 


(с) А)’ = A’, 
(d) (AB)’ = B'A’. 
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Linear dependence of row-vectors. We consider the operations of matrix 
addition and multiplication of a matrix by a scalar over a set of 1 x n 
matrices, i.e., row-vectors. 

Let 

а = (o1, a2, ..., On) € RT", 
b- (81, Вэ, б Bn) € Ren 
Using (5.3) and (5.4), we obtain 
a+ b= (ai + 3, a2 + Вэ, ...Й. Qn + Bn) (5.8) 


and 
Ха = (Хол, Ао», ..., Ма). (5.9) 


It is easy to verify that operations (5.8) and (5.9) satisfy the following 
conditions: 


(a)a+b=b +a, 

(b) (a + b)+ с=а + (Ь + с), 
(с)<а+0 = 0 +а = а, 

(d) Ма + b) = da + №, 


(е) A + ија = da + pa, (5.10) 
(f) Хра) = Qua, 
(в) за = а, 


the equation а + х = 0 being uniquely soluble for any row-vector a. In 
(5.10) А and y are arbitrary scalars, a, b, c and x are row-vectors (1 x n 
matrices), 0 is a zero row-vector (a zero 1 x n matrix). We shall see in 
Chap. 6 that conditions (5.10) define a linear vector space over a set of 
row-vectors. 

Now we shall introduce an important notion of linear dependence of 
TOW-Vectors. 

Let ат, 42, ..., am be m row-vectors. A relation of the form 


= Mai + №а +... + Хийл (5.11) 
is called a linear combination of row-vectors аг, 32, ..., am With scalars 
№, №, ..., №. 

The linear combination (5.11) is called nontrivial if at least one of X;, 
№, ..., № is distinct from zero and trivial if № = № = ... = № = Q. It 


is easy to see that in the latter case b is a zero row-vector. 

Row-vectors are called /inearly dependent if there exists their nontrivial 
linear combination identical to a zero row-vector, and linearly independent 
if a zero row-vector corresponds to their trivial linear combination only. 
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We show that for linearly dependent row-vectors one of them may be 
expressed as a linear combination of the others. 
* Let a), 32, ..., ат be linearly dependent row-vectors. This means that 
there exist scalars M, №, ..., №, not all zero, such that 


Ма + À232 +... + Àm- 18m- 1 + Амал = 0. 


Without a loss of generality we may set №, ~ 0. Transposing the first m — 1 
summands to the right, we obtain 


Хийт = — ai — №аз – ... — №-;аһ-1. 


Then dividing the above expression by М = 0, we arrive at 


ED OO LR 
ал= — S аре oon MDC Am 


Whence we conclude that am is a linear combination of the other row- 
vectors. 

The converse is also true, namely, if one of the row-vectors is a linear 
combination of the other (m — 1) row-vectors, i.e., if 


ап = ша + [232 + ... + рп-1ат- 1, 
then the nontrivial linear combination 
pias + posa +... + Bm-l18m- i1 + (-Dam 


of the row-vectors ач, 32, ..., Am—1, 4m iS equal to a zero row-vector. Hence 
the row-vectors are linearly dependent. > 
Similar property of linear dependence is easily derived for the set R” х! 
of column-vectors, that is, for the set of all m x 1 matrices. 
Elementary operations on matrices. Let A and A be two arbitrary 
m X n matrices and let 


а, 22, ..., ак, ..., а, ..., Am 


be rows in А. _ 
The matrix A is said to be obtained from the matrix A by 


(a) interchanging two rows in A if a1, 32, ..., а, ..., ак, ..., ат are 
rows of A, 

(b) multiplying one particular row of A by a nonzero scalar @ if ai, 
аз, ..., Вак, ..., а, ..., am are rows of A, 


(c) adding one row of A multiplied by a scalar y to another row if ai, 
аз, ..., ак, ..., A + Yak, ..., ат are rows of A. 

The operations (a)-(c) are called the elementary row operations. 

Elementary column operations can be defined similarly. 


8—9505 
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Example. The matrix 


1 0 9 
0 6 1 
0 1 0 
is obtained from the matrix 
1 0 9 
А-10 1] 0 
0 6 1 
by interchanging the second and third rows, and the matrix 
0 1 9 
1 0 0 
6 0 1 


is obtained from A by interchanging the first and second columns. 
Adding the third row of A multiplied by —2 to the first row of A, 
we obtain the matrix 


1 -12 7 
0 1 0 
0 6 1 


Remark. It is easy to see that when the matrix A is obtained by applying 
either of the elementary row operations to the matrix A, the transition from 
À to A is achieved by the same row operation, that is, either by interchang- 
ing the Ath and /th rows or by multiplying the kth row by the scalar 1/8 
or by adding the kth row multiplied by — у to the /th row. 

Now we shall turn our attention to the procedure of changing from 
an arbitrary matrix to a matrix of simpler form by a finite sequence of 
elementary row operations. 

Let A = (aij) € R"*" be a nonzero matrix. 

Step 1. Since A is a nonzero matrix there exists at least one nonzero 
element in A. Consequently, there exists at least one nonzero row in A. 
We choose a nonzero row such that its first nonzero element occurs in a 
column with the smallest number kı х Interchanging this row and the 


first row of A, we reduce A to the matrix 
0... 0 a ... ад 
0 ... 0 O2k| ... O2n , (5.12) 
0 0 Omk, Omn 


where «@ з 0. 
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Adding scalar multiples of the first row of (5.12) by — 2 (i = 2, 
Qi 1 





3, ..., m) to the corresponding rows, we obtain a matrix of the form 
0... 0 eo) e wm 
0. 2c). 0O uno ah (5.13) 
0... 0 0... a 


Observe that the elements in the kıth row are zeros with the exception of 
the element af}, . 

It may happen that rows of (5.13) are zero ones with the exception of 
the first row. In this case the procedure terminates. If this is not the case, 
ie, if there exist nonzero rows in addition to the first row, the procedure 
Boes on. 

Step 2. Similar to Step 1 we choose a row such that its first nonzero 
element occurs in a column with the smallest number Ёс (kı < k2). 

Then interchanging this row and the second row of (5.13), we obtain 


о... 0 af, 225 afk i af, 14: a 

о... о 0 223 0 af, ... af Y, 

о0о... 0. 0... 0 T MEETS 
(5.14) 


where a$, x 0. 
(D 


Adding scalar multiples of the second row by — RS (1 = 3, 4, ..., 
а 2 





m) to the corresponding rows, we reduce (5.14) to a matrix such that its 
first row is identical to that of the first row in (5.13) and its elements in 
the Koth row are zeros with the exception of two elements from the top. 

The procedure terminates if the matrix thus obtained contains no non- 
zero rows but the first and second rows, and goes on otherwise. It is impor- 
tant to observe that for the procedure discussed the total number of steps 
to reduce a given matrix to a simpler form does not exceed the number 
min (m, n). This means that the procedure terminates in a finite number 
of successive steps and we arrive at a matrix of the form 





87 


116 5. Matrices. Determinants. Systems of Linear Equations 


where ki < ka <... < k, and 


a, z 0, aR, # 0, gue; aQ #0. 


The matrix (5.15) is called a matrix of the schematic form. 

Notice that the procedure of reducing a matrix to the schematic form 
involves a sequence of elementary row operations (a) and (c). In other 
words, we may state the following theorem. 

Theorem 5.1. The transition from any arbitrary matrix to a matrix of 
schematic form is achieved by a sequence of elementary row operations. 

Examples. (1) Reduce the matrix 


00 00 01 
[o0 -2 3 -45 
^-loo oo оо 
01 11 11 


to a matrix of schematic form. 
я ids the first and fourth rows of A, we obtain 


1 1 
23-43 
0 0 
0 1 


Interchanging the third and fourth rows of Ау, we obtain 


01 11 11 
[90-23-45 
=“\oo0o 0 0- о 1 

00 00 00 


The matrix А> is of schematic form. > 
(2) Reduce the matrix 


3-1 32 5 
ds5-3 23 4 
“(10 -3 -5 0 -7 

7 -5 14 dg 


to a matrix of schematic form. | 
я эл the first and third rows, we obtain 


5.1 Matrices 117 


Step I. Subtracting the first row of A; multiplied by numbers 5, 3 and 
7 from the second, third and fourth rows, respectively, we obtain 


1 -3 5 0 7 
a 0 12 27 3 39 
?7 10 8 18 2 26 


0 16 36 4 50 


Step 2. To simplify computations we apply the elementary operation 
(b), namely, we multiply the second row by 1/3, the third row by 1/2 and 
the fourth row by 1/2. Then from А» we obtain 


1 -3 5.0 -7 
0 4 91 B 
0 4 9 1 B 
0 8 18 2 25 


Subtracting the second row of Аз multiplied by 1 and 2 from the third 
and fourth rows, respectively, we arrive at 


1 -3 5 0 -7 
0] 49 1 8 
A-lo 00010 
0 090011 


Step 3. Observe that the third row of A4 is a zero one. Then interchang- 
ing the third and fourth rows, we obtain 


1 -3 5 0 7 
0 49 ] 13 
0 0 0 0 [-1 
0 00 0 0 


As = 


The matrix As is of schematic form. » 
By a sequence of elementary column operations a matrix of the form 
(5.15) is reduced to the form 


Ё 
1 0 0 
0 1 0 0 
F » 66 о э а а э э е c o. э ө «а э э э э 
00... 1 ... 0 ы 
ee nu e : р (5.16) 


0 0 
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whose elements are zeros with the exception of those occupying the posi- 
tions (1, 1), (2, 2), ..., (r, r), each of which is equal to unity. 
Interchanging the columns in (5.15) so that the kıth column replaces 
the first one, the Ко column replaces the second one, etc., until the k,th 
column replaces the rth column, we obtain a matrix of the form 


jj 8: ... бү ... Qin 


: (5.17) 





where ёп = 0, à22 # 0, .... Orr ж 0. 
For example, interchanging the third and fifth columns in As, we obtain 


1 —3 7 0 5 


Q4 1 19 
Ae-lo Dai 0 0 
0 о 0 00 


Adding multiples of the first column of (5.17) by — 29. (j = 2,3, ..., 
aii 


n) to the corresponding columns, we obtain a matrix of the form 


ài 0 ... 0 ... 0 ... 0 
0 822 2; Gar Ол 
0 0 0 Orr бул 
0 0 0 0 0 


where the first row contains only one nonzero element, ài;. 
Similarly, operating on the rows 2, 3, ..., r, we obtain the matrix 


& 
1 вз 0 3 
As i (5.18) 
0 ET 


By a sequence of elementary column operations of type (b) matrix (5.18) 
is reduced to the form (5.16). 
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It is easy to observe that the matrix Ас admits a representation of the 
form 


10 000 

04 000 
А= {оо 100p 

оо ооо 

whence we obtain 

10000 

01000 
44-10 01 00 

00000 


Elementary matrices. The elementary operations which are summarized 
above are closely associated with square matrices called elementary ma- 
trices. These are of the following types: 

(a) matrices obtained from the corresponding identity matrices by inter- 
changing any two rows. For instance, the matrix 


J 
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by interchanging the ith and jth rows. Observe that off-diagonal elements 
of Ру; are zeros with the exceptions of the elements in the (i, /)th and 
Q, Dth positions. 

(b) Matrices obtained from the corresponding identity matrices by sub- 
stituting a nonzero scalar for a diagonal element. For example, the matrix 


differs from the identity matrix by the element В # O in the (/, /)th position. 
Notice that all off-diagonal elements of D; are zeros. 

(c) Matrices that differ from the corresponding identity matrices by one 
off-diagonal element. For example, the matrix 


i 
1 


Li = EAD ares xu J 


differs from the unit matrix by the element « in the (/, i)th position, and 
the matrix 


Rj = ьа) ee ee i 


2 N 
also differs from the identity matrix by the same element but located in 
the (i, /)th position. Observe that off-diagonal elements of Lj; and Ri; are 
zeros with the exception of the element y. 
We point out here that for any matrix each elementary operation is 
equivalent to pre- or post-multiplication of the matrix by a suitable elemen- 
tary matrix. 
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Theorem 5.2. Elementary operations on a matrix are equivalent to pre- 
and post-multiplication of the matrix by elementary matrices. 

Let А be an arbitrary matrix and Р;;, D;, Li; and Ri; be elementary 
matrices given above. Then 

(a) interchanging the ith and jth rows of A is equivalent to pre- 
multiplying A by Pj, 

(b) multiplying one particular row of A by a nonzero scalar @ is equiva- 
lent to pre-multiplication of A by Dj, 

(c) adding one row of A multiplied by a scalar y to another row is 
equivalent to pre-multiplying A by Їл, 

(a^) interchanging the ith and jth columns of A is equivalent to post- 
multiplication of A by Р», 

(b^) multiplying one particular column of A by a nonzero scalar 6 is 
equivalent to post-multiplication of A by Dj, 

(c’) adding one column of A multiplied by a scalar + to another column 
is equivalent to post-multiplication of A by Ri;. 
ча For simplicity we consider a square matrix of order 3 


Recall that pre-multiplying A by the matrix 


1 0 0 
P3={0 0 1 
0 1 0 


gives 
Oii 012 13 
B= 031 032 033 
21 22 923 


and post-multiplying А by P23 gives 
11 Q13 Q12 


С= са 023 022 
Q3; 033 032 


It is easy to see that the matrix B differs from A by the order of rows 
and the matrix C differs from A by the order of columns. 
Similarly we can verify the conditions (a) and (a’) for the matrices 


0 1 0 0 0 1 
Pi = 1 0 0 and Pi3 = 0 1 0 
0 0 1 1 0 0 
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Let us multiply A by the matrix 


1 0 0 
D-[09 8 0 |. 
0 0 I 


Pre-multiplying A by D», we have 


1 0 0 11 012 a3 
DA={0 g8 0 021 022 003 
0 0 I O31 032 033 


O11 012 13 
= | Born Barr fo 
зу O32 33 


Post-multiplying А by D2, we obtain 


Q11 012 Q13 1 0 
AD: = | an an оз 0 ) 
аз 032 азз 0 
ci fen an 
an Bar az |. 
az fo» азз 


Similarly, we verify the conditions (b) and (b’) for the elementary ma- 


trices 
в o 0 1 0 0 
D={0 1 0 and р:--10 | 0 


0 0 1 0 0 B 


Notice that the conditions (c) and (c’) can easily be verified in a similar 
way. > 2 


о о 
—- о 


5.2  Determinants \ 


We associate with a square matrix а single number called a deter- 
minant according to the following rules: 
The determinant of the matrix of order 1 


(ол) 


is equal to оці. 
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The determinant of the matrix of order 2 


Qi1 «2 
O21 22 


is a number equal to 011022 — 012021. 
In symbols, we write 


011 912 
det = 
21 022 


The determinant of the matrix of order 3 


“її 912 
021 922 


= 011002 — 012021. (5.19) 








ai Q12 O13 
O21 Q22 23 
031 032 Q33 


is a number equal to 




















022 Q23 912 13 Q12 C013 
ai + озі . 
a32 033 032 ©зз 022 Q23 
Using (5.19), we obtain 
“11 @12 13 O11 012 O13 
det | œn an az | = | ор az az 
Q31 032 033 031 032 033 
= 401022033 + 021032013 + 031012023 (5.20) 
— 051032023 — 021012033 - 031022013. ! 
8, o o 
З, faye? NA 2 
эрэ a SUA 
att, +h, YN 
ot F tq + to их х X 
Зо + Хүн" о \ “Ур 
+ JEU [sàs 
i ru NM 
ы du ч За, AS х 
"з 3, PM a 3 


Fig. 5.4 


Figure 5.4 is helpful to memorize formula (5.20). The drawing on the 
left refers to the arrangement of the positive terms in (5.20) and that on 
the right to the arrangement of the negative terms. 

Suppose that we have defined determinants for matrices of orders less 


than л. 
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The determinant of the matrix of order n 


11 012 Qin 
Du RPM i (5.21) 
Он Ол2 Onn 
is a number equal to 
D = ouMii - Мн... + (– 1)" о: Ма, (5.22) 
where Mj (i = 1, 2, ..., n) is the determinant of order n — 1 
12 O13 ... Qin 
O21 Q23 ohare O2n 
Qj-1,22 Qi-13 +--+ Qi-l,n (5.23) 
1-1, 041, --.  GCirl,n 
Оп2 Олз Qnn 
obtained by deleting from A the ith row and the first column. 
In symbols, we write 
O11 O12 Ойл 
Р-ааА-1АГ- 2528 21222221 (5.24) 
Qni | Ол2 Onn 


Formula (5.22) defines the expansion of determinant with respect to 
the first column. 

It is easy to see that for n = 2 and n = 3 formula (5.22) gives results 
identical to those given by (5.19) and (5.20), respectively. 

For example, for n = 3 we have 


D = o1(022033 — 0532023) — o21(012a33 — 032013) 
+ a31(012023 — 022013). (5.25) 
We abbreviate (5.22) by writing 
: - 
D = У) (-D'*loiMa. (5.26) 
i=l 


Example. Compute the determinant of the triangular matrix 


Оі] 012 013 ... Ойл 
0 бс 023 ... «л 
А = 0 0 033 ... On 


52 Determinants 125 


44 We have 
022 023 Q2n Е a 
33 3n 
0 0t33 sus. jn 
lAI EO qu. o m узд vid eg ca г = 011022 | ............ 
0 0 Onn 0 Onn 
=... = 031022... Ann. 


Whence we infer that the determinant of the triangular matrix is equal 
to the product of the elements on the principal diagonal. > 

The determinant of a matrix of order п — 1 obtained from the matrix 
A by deleting the ith row and the jth column is called the minor Mi; of 
A (Fig. 5.5). For instance, the minor М is the determinant of the matrix 
obtained from A by deleting the ith row and the first column. 





Minor М; Obtained 
from matrix A by 
deleting the ith row 


and the jth column 


Fig. 5.5 
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Consider n numbers 
n 
D; = 2i (-)'"ajyMyj (= 1, 2, ..., n). (5.27) 
It is easy to see that for j = 1 formula (5.27) coincides with (5.26). Let 
us prove that D = Dj = Р, = ... = Dy. 
~ We confine ourselves to the case when n = 3. By setting j = 2 in (5.27), 
we have 














Dı = -anM + 022M22 — аз Мэ, (5.28) 
where 
az 023 
Мр = = 0210033 — 031023, 
Q31 33 
Qj1 Q13 
Mz = = ал033 — 031013, (5.29) 
аз: a3 
оп] олз 
My = = 011003 — 013001. 
озу Ооз 








Whence we obtain 
Dy = – anlaa — 0531023) + a22(a11033 — 013031) 
— anlaa — 013021). (5.30) 


Comparing (5.25) and (5.30), we conclude that D = D2. The identity 
D = D; is verified in a similar way. > 

Remark. In general the identities D = Р; = D; = ... = Dg are verified 
by computing determinants of orders n — 1, n — 2, etc. 

Therefore we have proved that the expansion of determinant with 
respect to the jth column is given by 


D- 5 (=) Му G=1, 2, ..., n). (5.31) 


i=l 


We call the number: 
Ay = (-1)'*4My : (5.32) 


the cofactor of the element oj; in 1А!. 
Notice that the cofactor Ai of oj; is independent of the value of aj. 
In other words, Aj; remains unchanged when the element aj; is replaced 
by Bij in A. 
Substituting (5.32) into (5.31), we may write 
n 
р = M aijjÁi;j = ayj + oojA2j +... + OnjAnj 


i=l 


G=1,2,... п). (5.33) 
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Whence we conclude that the determinant of a square matrix is equal to 
the sum of the elements of any column multiplied by their respective 
cofactors. 

Similarly to (5.27) we may introduce the numbers 


n 
А = У) (-1) о; Му (= 1, 2, ..., n) (5.34) 
j=l 
such that A; = A2 = ... = A,. To verify these identities it suffices to inter- 


change the rows and columns іп A, i.e, to transpose A, and apply to A’ 
the same reasoning as above. In conclusion we shall prove the following 
theorem. 

Theorem 5.3. For any square matrix of order n there holds 


D= У (-U'*ajgMj (і = 1, 2, ..., n) (5.35) 
j=l 


ie., the determinant of any square matrix of order n admits expansion with 
respect to the ith row. 
-4 It is sufficient to prove that 


Ai = D. (5.36) 


Again we confine ourselves to n = 3. 
Using (5.34), we have 


A; = апМи — anM + оз Муз = от (022033 — 023032) 


— о12(021933 — 023031) + ол3(021932 — а22031). 


Comparing this expression with (5.25), we infer that A; = D. > 

Remark. In general the identity A, = D is verified by computing deter- 
minants of orders n — I, n — 2, etc. 

Substituting (5.32) into (5.35), we arrive at 


n 
D = У) ayAy = апдАп+ арАр +... + ainAin 


j=l 
(= 1,2, coy. (5.37) 


Therefore, the determinant of a square matrix of order n is equal to 
the sum of the elements of any row multiplied by their respective cofactors. 

Example. Compute the determinants of the elementary matrices Pj, 
D; and Ly. 
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ча The determinant of Pj; is 


i Ј 
1. 
1 
0 “477 1 i 
IPjl = 1 
1 
1 0 J 


Expanding Pj; with respect to the first row, we have a determinant of 
Order n — 1 equal to ЇР, Then expanding the determinant of order n — 1 
with respect to its first row, we obtain a determinant of order n — 2 which 
is also equal to ЇР, It is easy to see that going on with this procedure 
we obtain 


IP] = |1- - 0 


after i — 1 steps. 
Using this procedure further, we finally arrive at 


0 1 


Exp t 


ЇР, = 








Recall that D; is a diagonal matrix of order n such that aj = 6 and 
the other diagonal elements are unities. Then 


ID; = £. 
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For the matrix Li; we have 
197 = 1. > 


Properties of determinants. Chapter 1 represents a detailed account of 
properties of the second- and third-order determinants. Here we point out 
properties possessed by an arbitrary determinant of order n. 

Linear property. Suppose that we are given a determinant D such that 
one of its rows is a linear combination of the other two rows. Let 


Qt) Q12 Qin 
D-|Mitnuyy №: + шр... Аж итп | і. 
Ant An2 Qnn 


Then 
D = XD’ + aD”, 


where the determinants 


оп 012 Qin Qj1 12 Qin 
D'- fi Bo Bn 1 and D” = yı үү Үп |! 
Оп] On2 Qnn Оп] On2 Onn 


differ from D by their ith rows. 

*4 To verify this property it suffices to expand D, D’ and D” with respect 
to their ith rows. Observe that the cofactors of №; + нуу, Bj and гү, are 
all equal to Ajj. Then using (5.37) we have 1 


D= Ў) (8j + рү)Аџ, 


j=l 
D’ = У] Ау and D" = }, yA. 
j=l 1-1 


Whence 
D == AD' + aD”. » 


Property of antisymmetry. If a determinant D is obtained from a deter- 
minant D by interchanging any two rows in D, then 


Б = -D. 


9—9505 
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<~ Suppose that 


21 022 O2n 

5 Q1 12 Qin 
D => 

Oni QOn2 Onn 


O11 O12 fad Qin 

O21 922 Q2n 
D= 

Qn Q2 Qnn 


by interchanging the first and second rows in D. 
Expand D with respect to the second row and D with respect to the 
first row. Using (5.35), we obtain 


D = —ox M» + а2М +... + (— 1)? + "сл Man 
and 


D = a231M3i — «М» +... + (D! + "oos Mos. 
Whence 
D--D» 


Similarly we may prove this property in the case when D is obtained 
from D by interchanging any two rows. 

Transposition of a determinant. The operation of transposing a matrix 
A leaves the determinant of A unchanged, i.e. 


IA’! = IAI. 


ча Expanding ІА! with respect to the first row and |A'| with respect to 
the first column, we infer that |A’! = JAI. > 

Notice that this property implies the equivalence of rows and columns 
of any determinant. So the above two properties may be described in terms 
of columns of determinants. - 

Multiplication of determinants. Let А and B be two square matrices 
of the same order. Then 


I ABI = lAI- IBI, 


that is, the determinant of the product of two square matrices of the same 
order is equal to the product of the respective determinants. 

Now we shall mention some properties of determinants which are help- 
ful in computing determinants. 

(a) The determinant with two identical rows is equal to zero. 
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+4 Indeed, by property of antisymmetry interchanging any two rows 
reverses the sign of determinant. However, interchanging two identical rows 
leaves the determinant unchanged. Hence D = —D. This means that 
D = 0. > 

(b) Multiplying a row by a scalar is equivalent to multiplying the deter- 
minant by the same scalar. 
ча This follows from linear property provided that д = 0. > 

(c) The determinant with a zero row is equal to zero. 
ча To prove this property it suffices to expand the determinant with respect 
to the zero row. Pr 

(d) The determinant with two proportional rows is equal to zero. 
ча By property (b) the common factor may be taken outside the deter- 
minant. Then two rows become identical. Hence the determinant is equal 
to zero. > 

(e) Adding one row multiplied by a scalar to another row leaves the 
determinant unchanged. 
ча By linear property this operation results in the determinant equal to 
the sum of two determinants, і.е., the original determinant and that with 
two proportional rows. By property (d) the latter is equal to zero. Therefore 
the determinant remains unchanged. > 

In general a determinant remains unchanged when a linear combination 
of rows is added to any of its rows. The same is true for the columns of 
a determinant. 

Example. Prove that the sum of elements of one row multiplied by 
cofactors of the respective elements of another row is equal to zero. 
*4 Replacing the Ath row by the ith row in 


011 012 Qin 
Qtil Qi2 NM Qin I 

D = рро а c EL m (i zx k) 
Ок Оку  ... Ойл k 
Oni On2 Onn 

we obtain 

011 012 Qin 

2 Qtil Qi2 Qin І 

Da In serve 
Qil Qi2 Qin k 
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Observe that the ith and kth rows in D are identical. By property (a), 
D = 0. Then expanding D with respect to the kth row, we arrive at the 
desired identity 


алд)» GizAk2 +... + ОпАк = 0 (AK). 


(Recall that cofactors of elements in any row do not depend on the values 
of these elements.) » 


Computing determinants. When computing determinants we widely use 
elementary (row and column) operations. It is important to point out that 
being subjected to elementary operations either (a) or (b) or (c) the deter- 
minant either reverses its sign or is multiplied by a scalar or remains un- 
changed, respectively. Therefore the elementary operations are closely 
related to the properties of determinants. 

Example. Compute the determinant of 


2 -5 43 
з 4 174 
А-| 4 -10 83 
-3 2 -5 3 


ча Observe that ai; # 0 and divides only two elements of the first column 
with non-zero remainders. To avoid division of elements of the matrix we 
multiply the second row by — 2, the third row by —1 and the fourth row 
by 2. Then we obtain 


2 -5 4 3 
-6 8 —14 -8 
-4 10 -8 -3 
—6 4 -10 6 


(-2)(-1)21А! = 


Step 1. Adding multiples by 3, 2 and 3 of the first row to the second, 
third and fourth rows, respectively, we obtain 


2- 5 4 3 
0 -7 -2 1 
0 0 0 3 
0 -ll 2 15 


41А| = 


Step 2. To avoid division we multiply the fourth row by —7. Then we 
have 


2 -5 4 3 
0 -7 -2 1 
(-7)41Al = 0 0 0 3 
0 77 -14 -105 
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Adding the second row multiplied by 11 to the fourth row gives 


2 -5 4 3 
0 -7 -2 1 
-281А| = 0 0 0 3 


0 0 -36 -94 
Step 3. Interchanging the third and fourth rows gives 


2 -5 4 3 
0-7 -2 1 
281А1= |o о -36 -%4 
0 0 0 3 


Computing the determinant of the triangular matrix given above, we 
obtain 


281Al = 2(—7)(— 36)3 
and finally 
IAI = 54. > 
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A square matrix is called nonsingular if its determinant is not equal 
to zero. 

Let A = (aj) be a nonsingular matrix of order n and let B be a square 
matrix of order n such that the element of B in the position where the. 
ith row and jth column meet is equal to the cofactor Aj; of a,j in A, i.e., 
B = (8j) where @ = Aji. Therefore the matrices A and B are related as 
shown below: 


011 adi Qin 
021 aoi O2n 
ГЭЭ BRR Oe eG а us 1 
«л ehe сл espe Qin J5 
Ол Oni Onn 
i 
Ан Ал Án 
Ai Ар An2 
БЭ eee es 
Aii Aii Ani i 
Ain Ал апп 
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The matrix B possesses the following property 


А!. 0 
AB = BA = сэр -1А14. (5.38) 
0 IAI 
By way of illustration we show that 


AB = IAI-I. 
ча The element of AB occupying the position where the ith row and the 
jth column meet is given by 

n 
yy = 2j QikÁjk. 
к-1 
Setting i = j, we see that гүн is the result of the expansion of the deter- 

minant of A with respect to the ith row, i.e., 


n 
үй = >) aixAix = IAI. 
k=1 


In other words, each diagonal element of AB is equal to the value of the 
determinant of A. 
By the exercise considered in the previous section, we have 
yy = 0 


for each off-diagonal element in АВ. > 
Similar reasoning is applicable to the identity BA = IAI-I. 

















The matrix 

Ап Ал Ani 

[АІ | А | | А | 

; хээ Эр "ET 2 

Sg - 2: li Ji ni 

A T= таг" [AT [Al ГА | G3?) 

Ain Ain Ann 

lA! | А | lA! 


is called the inverse of the matrix A. 
From (5.38) it follows that 


АА! =1, A^"!A-IL (5.40) 


This means that the matrix А ^ ! may be considered as а solution satisfy- 
ing two matrix equations simultaneously 


АХ =I and XA-I, 
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where 
xu Xij Xin 
X = Xil Xij Xin 
Хп Xnj Xnn 


is an unknown matrix. 

Let us show that A ^! is the only solution satisfying both these equa- 
tions simultaneously. 

Assume that there exists a matrix C such that 


АС -1 апа CA-I. 
Pre-multiplying the former identity by A ^! and post-multiplying the 
latter by А !, we have 


А (АС) = (A^ )) and (CA)A '! = (А). (5.41) 
By the associative law of multiplication we may write 


А (АС) = (A^ !A)C and (СА)А ^! = C(AA7?). 
Substituting these identities into (5.41) and using (5.40), we obtain 
IC- A^'I and CI = IA ^. 
Whence, (5.6) gives 
С=А-!. 


Computing inverse matrices. We shall mention here ап effective апа. 
rather straightforward method for computing an inverse matrix by succes- 
sive use of elementary (row and column) operations. We begin with theoret- 
ical background of the method, known as pivotal condensation. 

Theorem 5.4. By a sequence of elementary row (column) operations an 
arbitrary nonsingular matrix is reducible to the identity matrix. 
ча By Theorem 5.1 any matrix is reducible by elementary row operations 
to a matrix of schematic form. 

Suppose that we are given a square nonsingular matrix. 

Let us show that this matrix may be reduced to the triangular matrix 


af? af? o? af 
0 «2 a aR |, (5.42) 
0 0 a ... af? 
0 0 0 ... а 


where af? ж 0, a 4 0, af? 5 0, ..., a® 4 0 (see formula (5.15)). 
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By Theorem 5.2 each elementary row operation is equivalent to pre- 
multiplication of the given matrix by a suitable elementary matrix. Recall 
that the determinants of the elementary matrices are not equal to zero. 
We have also seen that the determinant of the product of two square ma- 
trices is equal to the product of the determinants of these matrices. Thus 
multiplying a nonsingular matrix by any elementary matrix and, conse- 
quently, any elementary row operation, gives rise to a nonsingular matrix. 
This means that if the matrix (5.42) is obtained from the given nonsingular 
matrix by applying a sequence of elementary row operations, (5.42) is non- 
singular and, consequently, its determinant 


aP a-a.. aR 
is distinct from zero. Whence we infer that 
af? #0, af #0, af #0, ..., a z 0. 


If (5.42) is subjected to a sequence of elementary row operations of 
type (b), it becomes 


1 ёр баз ... бал 

0 1 G23  ... боп : (5.43) 

0 0 1 3n 

0 0 0 1 

Adding multiplies of the nth row of (5.43) by — ài, —Gan, ..., 

— &n 1,5 to the Ist, 2nd, ..., (n — 1)th rows, respectively, we obtain the 
matrix 

1 ёр баз бїл-1 0 

0 1 823 бол-1 0 

0 0 1 А3, п- 1 0 , 

0 0 0 1 0 

0 0 0 0 1 


such that in its mth column all the elements but the last one, which is equal 
to 1, are zeros. 

Similarly, adding multiplies of the (n — 1)th row of the obtained matrix 
Бу-бїл-1» -ӨО2,-16 ...› ~@n-2,n-1 to the first n — 2 rows, we obtain 
the matrix such that its (n — 1)th and nth columns contain unities in the 
principal diagonal and zeros in the other rows. In the end of this process 
we add the multiple by — ài» of the second row of the matrix obtained 
after n — 1 steps to the first row and finally arrive at the identity matrix 
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1 0 


9 1 


and thus complete the proof. > 

In matrix form this theorem states the following. 

Theorem 5.5. Let A be an arbitrary nonsingular matrix. Then there exist 
elementary matrices Qi, Qo, ..., Ок such that 


Q.-Qrz-i....Q2QrA =I. (5.44) 
Post-multiplying (5.44) by A ^!, we obtain 
ОсОск-:-..--0::0: = AL 


Pivotal condensation. Suppose that we are given a nonsingular matrix 
A of order л. Let 


(AID) (5.45) 


be a matrix of order п x 2n. If (5.45) is subjected to a sequence of elemen- 
tary row operations or, equivalently is premultiplied by a sequence of 
elementary matrices, the matrix A reduces to the identity matrix and the 
identity matrix I reduces to the matrix A ^ ! inverse to A. Thus, by elemen- 
tary row operations (5.45) is reduced to 


(ПА 2). 
It is easy to understand how we should proceed to compute the inverse ` 


of an arbitrary nonsingular square matrix А = (aj) (i, j = 1, 2, ..., ny 
1. Set up the matrix 


a1) бїз... Ол 1 0 0 
Que c d 
Ол Сїл2 Qnn 0 0 1 


of order n x 2n. 
2. By elementary row operations reduce (A |I) to the form 


1 ёо ... Gin Bu Вә . Ва 
ams ратор, 
0 0 1 Bn Bn2 Bnn 


where A is the triangular matrix of order n obtained from A. (See the proce- 
dure used to prove Theorem 5.1.) 
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3. By elementary row operations reduce (Ã |B) to the form 


1 0 0 үп y2 Yin 
11504 В ee s E, 
0 0 1 “Үл! Үп2 Ynn 


where I is the identity matrix obtained from A. (See the procedure employed 
in Theorem 5.4.) 
4. The matrix С = (yy) of order n (i, j = 1, 2, ..., n) in (1С) is the 
inverse of the matrix A, i.e., 
С-АС 
Example. Compute the matrix inverse to 
1 1 1 1 


1 l -1 -1 
A=] ea 1 | 
1—1 -1 1 
< Set up the matrix of order 4 х8 
| 1 1 I[i 0 0 0 
ce Б өзер 2511 0221-20-20 
В-| 1-1 1 -1]001 0 
1-1 -1 1/0 001 


Step I. Subtract the first row from the other three rows: 
1 1 1 1 1000 

0 0 -2 -2] -1 1 0 0 

Be Berg xc o2. 0 1:0 

0 -2 -2 01-01 00 1 


Step 2. Observe that a22 = 0. Interchange the second and third rows 
and subtract the second row thus obtained from the fourth row: 


1 1 1 1 1 0 0 0 
0 -2 0 -2|-10 1 0 
0 0 -2 -2]|-1 1 0 0 
0 0 -2 2 00 -1 1 


В; > В, = 


Step 3. Subtract the third row from the fourth one and divide each row 
by its diagonal element: 


1111 1 0 0 0 
gRom-[9191|12 0 -1⁄2 0 
2 37100 1 141722 -172 0 0 

0001/14 1/4 -1/4 1/4 
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Step 4. Subtract the fourth row from the other three rows: 


11 10|3/4 1⁄4 1/4 -1/4 
pBom-.19199|v4 1/4 -1⁄4 -1⁄4 
LipC: 0 0 0 | 1/4 -1/4 1/4 -1/4 

0 0 1 | 1⁄4 1⁄4 -1/4 1/4 


Step 5. Subtract the thir 


e 


Step 6. Subtract the second row from the first one: 


100 VA 1⁄4 1⁄4 1⁄4 
0 10 

Bs=Be=| 9 о 1 
000 


1/4 1/4 -1/4 -1/4 
1/4 -1/4 1/4 -1/4 
Whence we conclude that 
1 


1/4 -1/4 -1/4 1/4 
4 


5.4 Rank of a Matrix 


Suppose that we are given the matrix 


row from the first one: 


0 | 1/2 1/2 0 0 
0 | 1/4 1/4 -1/4 -1/4 
0| 1/4 -1⁄4 1/4 -1/4 
1 | 1/4 1/4 -1/4 1/4 


or coo а of 


оо о ~ 
Dore 


~ о о о 


А! =- А. > 


Q11 Q12 Qin 

A= 21 022 O2n 

Om! Om2 Omn 
Let us choose in A k rows and k columns such that i < h < ... < ik and 
Л Л... < Jk refer to the chosen rows and columns, respectively, and 


set up the kth order matrix 


“ил «ил 7 5 О; jk 
А = ij Әј --- Qj 
Gi, ji &i, j Qi, j 


The determinant Mx of this matrix is called the kth order minor of 
the matrix A. 

It is easy to see that an m x n matrix has minors of orders 1, 2, ..., 
min (m, n). 
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Example. We consider an 11 x 14 matrix shown in the upper part of 
Fig. 5.6. Let us choose 7 rows and 7 columns, say, the rows numbered 1, 
3, 4, 6, 8, 9, 10 and the columns numbered 2, 5, 6, 7, 10, 12, 13. If we 
set up a matrix of order 7 comprising the elements which are contained 
in both the rows and columns chosen, we get the matrix shown in the lower 
part of Fig. 5.6 provided that the mutual arrangement of these elements 
in the original matrix is preserved in the new one. The determinant of the 
matrix thus obtained is the 7th order minor of the original matrix. 

Let A be a nonzero matrix. Then 







2 567 10 — 1213 there exists a number r such that 
|а E (a) at least one rth order minor 
7 7 of A is distinct from zero and 
à 222 | (b) any sth order minor of A 

, 5 (s > г), if it exists, is equal to zero. 
9 22222 The number r is called the rank 
8 2 РА p of the matrix A. 
9 2 27 In symbols, we denote the rank 
| Үд 2 of A by r(A). 
Obviously 
| 0 < r(A) < min(m, n). 


The minor M, which is distinct 
from zero and such that its order is 
equal to the rank of the matrix A 
is called the base minor of A. The 
77 17 rows and columns of A comprising 

the elements of the base minor of 

am A are called the base rows and the 
base columns of A. 

Fig. 5.6 Theorem 5.6. Let A be an ar- 
bitrary matrix. Then 

(1) The base rows (columns) of A are linearly independent. 

(2) Each row (column) of A admits a representation in the form of a 
linear combination of the base rows (columns) of A. 

-4 For definiteness assume that the base minor of A is of order r and 
is in the top left corner of the matrix A, i.c., 





ZZ 


САА 





1, 1 Qin 
Gr,rc1 Ойт 
A= . (5.46) 
Qr+ 1,1 ... Gr l,r Gre 1, г+ 1 Qr 1,n 


Оті ... Omr Om,r-1 Qmn 


5.4 Rank of a Matrix 141 


Then the first r rows of A denoted Бу a), аз, ..., a, become the base rows 
of A. 

We shall prove that ац, a2, ..., а, are linearly independent. Let us sup- 
pose the opposite, i.e., the rows a1, 32, ..., a, are linearly dependent. Then 
one of the rows is a linear combination of the other r — 1 rows. Assume 
that 

а, = ха + X232 +... NL 18; 1. 


This means that the rth row of the base minor M, is a linear combination 
of the other rows of M,. Recall that any determinant such that one of its 
rows is a linear combination of the others is equal to zero. Hence, М, = 0. 
But this contradicts the definition of the base minor. Consequently, the 
assumption on the linear dependence of aj, аз, ..., а, is false. Thus we 
infer that aj, a2, ..., а, are linearly independent. 

Now we show that each row of A can be represented as a linear combi- 
nation of the other base rows of A. First, we ascertain that for any i and 
JQ <i<m, 1 <JjK< n) there holds 








O11 Aes Gir ay 

A= 23 -0. (5.47) 
(8121 PvE ағ ar 
| ал t ues Gir Qij 


Indeed, if i < r Acontains two identical rows, and if j < r А contains 
two identical columns. Hence, A = 0 in both these cases. When i > r and 
/ > т, А becomes the (г + 1)th order minor of the matrix A. 

Let us fix a subscript i (1 < i < т), the ith row, and expand the deter- 
minant A with respect to the last column. We get 


оду + agjA2 +... + aA, + aijM, = 0. (5.48) 


The identity (5.48) holds for any j (1 € j < n) and Ai, A, ..., A are in- 
dependent of /. 


Setting 
we may write (5.48) in the form 
ay = Moi + Maz +... + ray, ј = 1, 2, ..., п, 
ог 
ап = Ma + Maz +... + Хол 
anr = Mei + X022 +... + Mam. (5.49) 
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From (5.49) it follows that 
а; = Ма + №а +... + X8. 


Whence we conclude that the ith row of A is a linear combination of the 
base rows аг, аг, ..., ar. Since the above reasoning is applicable to any 
other row of A we infer that each row of A is a linear combination of 
the base rows in А. > 

Proposition. If a matrix is subjected to elementary operations its rank 

does not increase. 
4 Let A be a matrix of rank F obtained from a matrix A of rank r by 
interchanging its rows. We consider an arbitrary minor М, of order s in 
A and choose in A a minor M; of the same order s, such that the elements 
of M; are arranged in the rows and columns of A in the same manner 
as the rows and columns of A comprising the elements of М,. Observe 
that interchanging the rows of A rearranges the rows without altering them. 
This means that the rows in М, may only differ from those in М, by their 
arrangements. Whence it follows that either М, = M, or M, = —M,. 

By definition of the rank all minors of A whose orders exceed r are 
equal to zero. From the above consideration we infer that any sth (s > r) 
order minor M, of A is equal to zero, i.e., М, = 0. This means that the 
rank of A cannot exceed the rank of A, ie, F € л. 

Similarly, we may show that F < г in the cases when the matrix A is 
obtained from A by elementary operations of the other two types. Obvious- 
ly, this assertion is true for elementary column operations. 9» 

Theorem 5.7. If an arbitrary matrix A is subjected to elementary row 

(column) operations its rank remains unchanged. 
“4 It suffices to recall that if the matrix A is obtained from the matrix 
A by elementary operations, the matrix A is obtainable from A by elemen- 
tary operations of the same type. Taking into account the above assertion, 
we conclude that r < F. 

Comparing the inequalities ғ < r and r < F, we arrive at the desired 
identity 


r=r. > 


Remark. The number of nonzero rows in a matrix of schematic form 
is equal to the rank of the matrix. 

Indeed, for a matrix of schematic form the rth order minor whose ele- 
ments are contained in the first г rows and in the columns Ki, ёо, ..., 
К, is distinct from zero, 1.е., 
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0 
ай, = af? a, ... a £0 


0 | D 


Qrk, 


and any minor of order s (s > r) comprises the zero row and, consequently, 
is equal to zero. 

Thus, subjecting a matrix to a sequence of elementary operations, we 
may easily and effectively define the rank of this matrix. 

Example. Find the rank of 


2 13 -24 
A=(4 -25 171. 
2-11 82 


ча Step 1, Subtracting the multiples of the first row by 2 and 1 from the 
second and third rows, respectively, we get 


Qc. d$ 1 
rA) -r(o 0|-1 5 -1]. 
0 0|-2 10 -2 


Step 2. Subtracting the second row multiplied by 2 from the third row, 
we get 


2 -1 3 -2 4 
rA)-r[(o ol-1 s -1)=2. » 
о о 0 0 0 
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Definitions. Suppose that we are given a matrix 


оп Qa ... Qi 0 

2 Q21 Q22 ers [^5 

Abe сыш: юма! (5.50) 
Omi Am2 Отп Bm 


whose first n columns are nonzero. 
A collection of relations of the form 
ахі + 2X2 +... + олпХа = ĝi, 
021X2 + a22X%2 +... + OonXn = В, (5.51) 


OmiXi + Om2X2 +... + OmnXn = Вт, 
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where Xi, Хо, ..., X, are quantities to be defined (unknowns), is called 
the system of m linear equations in n unknowns or the linear system. 

The numbers a; (i = 1,2, ...,m;j = 1,2, ..., п) are called the coeffi- 
cients of the linear system (5.51) and the numbers 8; (i = 1, 2, ..., m) are 
called the constant or free terms. 

A solution of the linear system (5.51) is an ordered n-tuple of numbers 
Y1» Y2> +++) Yn Which, being substituted into (5.51) for the unknowns xi, 
X2, ..., Ха, turns each equation of (5.51) into identity. 

The system of linear equations is said to be consistent if it has a solution 
and inconsistent if it has no solutions. 

Two solutions yi, үг, ..., ya and yi, yz, ..., yn are called distinct 
if at least one of the identities y: = үү, Y2 = yi, ..., Yn = Yn is violated. 

A consistent system is called determinate if it has a unique solution, 
and indeterminate if it has at least two distinct solutions. 

In matrix form, the system (5.91) may be written as the matrix equation 


AX - b, (5.52) 
where 
Oi O12 Qin 
А = 021 922 QO2n : 
a a DM 
(5.53) 
Bi x 
b- Pa and X= 28 
Bm х, 


The matrix А is called the coefficient matrix of the system (5.51), the 
column-vector b is called the column-vector of constant terms and the 
column-vector X is called the column-vector of unknowns. The matrix 


A = (Alb) 


is called the augmented matrix of the system (5.51). The solution of the 
matrix equation (5.52) is the column-vector 


yı 
r-[ 7 


Yn 
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Theorem 5.8 (Kronecker-Capelli theorem). The system of linear equa- 
tions (the linear system) is consistent if and only if the rank of the coeffi- 
cient matrix is equal to the rank of the augmented matrix. 

«~ Suppose that the system (5.51) is consistent. This means that the n-tuple 
of numbers yi, y2, ..., ун turns each equation of (5.51) into identity, that 
is, 

«лүп + олуүз +... + Qinyn = 81 

«зүү: + 022ү +... + O2nyn = B2 


Ат1Ү + Am2y2 + ... + атпүп = Bm 


From the above identities it follows that in the augmented matrix 
A = (Alb) the column-vector b of constant terms is a linear combination 
of the other columns in A, ie., b is a linear combination of the columns 
of A. 

Adding to the column-vector b the first column of A multiplied by — y1, 
the second column of A multiplied by — y2, etc., and finally the nth column 
of A multiplied by — yn, we reduce the augmented matrix A to the matrix 


A = (А 10). 


The rank of A is equal to that of A since, by Theorem 5.7, the elemen- 
tary column operations leave the rank unchanged. On the other hand, it 
is clear that the rank of A = (А 10) is equal to that of A. Whence we infer 
that r(A) = r(A) = r(A). 

Now we suppose that the rank of the coefficient matrix is equal to that 
of the augmented matrix. 

Since A = (Alb), the matrices A and A have common base minors. 
Let the common base minor be of order r and let this base minor be in 
the top left corners of the respective matrices. 

By Theorem 5.6 any column in A admits a representation in the form 
of a linear combination of the base columns of A. In particular, the 
column-vector of constant terms, i.e, the (n + 1)th column in Á, may be 
written as 


Qj 012 Qir Bi 
021 022 02, B 
yi . + Үү i Hrab tyy . = : 
Оті Qm2 Omr Bm 
Or 
«зү: + ayz +... + airy, = 81, 


«зү + 2272 +... + Оогүг Вэ, 


Ат1Үү1 + Am2y2 + ... + атгүг = Bm. 


10—9505 
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It is easy to see that the ordered n-tuple of numbers 
ҮІ, Ү2, ---» Yrs 0, ..., 0 


turns each equation of the original system into identity. Whence it follows 
that the system (5.51) is consistent. » 

Equivalence of linear systems. We shall call a collection of all solutions 
of a given linear system a set of solutions. 

Two linear systems comprising equal numbers of unknowns are said 
to be equivalent if their sets of solutions coincide. In other words, two 
linear systems are equivalent if each solution of one system is a solution 
of the other one and vice versa, or if both these systems have no solutions. 

It is clear that any linear system is uniquely determined by its augmented 
matrix. Suppose that A and A’ are two m x (n + 1) augmented matrices 
of the linear systems 


aX + ой? +... + QinXn = Bi, 
G&z1X1 + 00202 +... + OanXn = 82, (*) 
Ат1Х1 + Om2X2 ^u. OmnXn = Bm 

and 
«ї1Х1 + 2X2 +... + O fn Xn = Bi, 
ах + 022X2 +... + «з„Хл = BI, (*’) 
OmiXi + оро +... + OmnXn = Bm- 


The system (*’) is said to be obtained from the system (ж) by elementary 
operations if the augmented matrix A’ is obtained from the augmented 
matrix A by elementary row operations. 

Theorem 5.9. If the linear system (*') is obtained from the linear system 
(9) by elementary operations these systems are equivalent. 
ча First, we suppose that the system (*) is consistent. Let y1, yo, ..., Yn 
be a solution of (*). We show that this n-tuple of numbers turns each equa- 
tion of (»') into identity. Consider three cases which correspond to elemen- 
tary operations of different kinds: 

(a) Assume that the system (*') is obtained from (+) by interchanging 
rows in A. This means that the matrix A’ comprises the same collection 
of rows as A and, consequently, the system (*’) contains the same collec- 
tion of equations as (+). Whence we conclude that if the n-tuple y1, yz, 

. +» Yn turns each equation of (*) into identity, it also turns each equation 
of (*’) into identity. 

(b) Assume that the system (*’) is obtained from (ж) by multiplying 
the kth row in A by a nonzero scalar №. Then the Ath equation of (97) 
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will be of the form 
(Лак)  (Xax2)X2 +... + Aten) Xn = Bx 


and the other (m — 1) equations will be the same as in (*). 
Substituting yi, Y2, ..., Үл into the Ath equation of (x), we obtain the 
identity 


ак1ү + Ok2'y2 +... + Okn'yn = Br. 
Multiplying this identity by А # 0, we arrive at 
(Аек): + Q2) ya Shy cece (Aan) Yn = Abk. 


Whence we conclude that the solution of (*) is also the solution of (*’) 
since yi, y2, ..., Үл turns the Ath and all other equations of (*') into 
identities. 

(c) Now we assume that the system (*') is obtained by adding to the 
ith row in A the kth row multiplied by a scalar д. Then the /th equation 
of (57) will be of the form 


(оті + noki)xi + (om + рок) 
2 F (Qin + BOtkn) Xn = 8: + ибк (5.54) 


and the other (m — 1) equations in (*’) will be the same as in (=). 
Substituting Үл, Y2, ..., Үл into the Ath and /th equations of (ж), we 
obtain two identities 


«күү + Ok2'y2 + ... + Okn'Yn = Вк, 
any. + а2%ү2 + ... + ОлҮл = Br. 


Substituting the multiple of the first identity by » into the second one, 
we obtain 


(an + uoui)yi + (02 + pork) "үг 
+... + (Qin + рокп)үп = Bi + pbk. 


Whence we infer that the solution y1, Y2, ..., Үл of (ж) is also the solu- 
tion of («’). Notice that by substituting у, yz, ..., Үл into (5.54), we reach 
the same result. 

Thus, in each case any solution of the system (*) is a solution of the 
system (*'). 

Observe that the system (ж) may also be obtained from the system (*’) 
by elementary operations. Repeating the previous arguments, we infer that 
any solution of the system (*') is a solution of the system (x). 

Second, we suppose that the system (ж) is inconsistent. Then the system 
(*’) 15 also inconsistent. To prove this fact, assume the opposite, i.e., assume 
that the system (* ') is consistent. Then Theorem 5.9 implies that the system 


10" 


148 5. Matrices. Determinants. Systems of Linear Equations 


(*) is also consistent. However, this contradicts to what we have assumed. 
Whence we conclude that if the system (*) is inconsistent, so is the system 
(*'). » 

Remark. Clearly, if the system (*’) is obtained from the system (ж) by 
a finite sequence of elementary operations, then these systems are 
equivalent. . 

Gaussian elimination. When we wish to solve a linear system we should, 
first, find out whether the system is consistent, and if this is the case, 
second, find the set of its solutions. 

We shall outline a method of solving a given linear system called the 
method of Gaussian elimination. The method employs elementary opera- 
tions to reduce a given linear system to a system of simpler form whose 
solutions are easily computed. 

The elementary operations on a linear system are equivalent to the 
elementary row operations on its augmented matrix. So we shall simultane- 
ously consider the linear system 


О11Х) + 0120 +... + оллХл = Ві, 
021X1 + 022X2 +... + Q2nXn = B2, (ж) 
Omi1iXi + Am2X2 c* vii QmnXn = Bm 


11 012 Gin B 1 
А оо] az Ол 8: 
A= 

Omi Om2 Omn Üm 


In the previous sections we have shown that the elementary row opera- 
tions may reduce the augmented matrix À to schematic form 


af) 
aR, | 


А’ = aR 


В) \ 
0 


0 


Similarly, the system (*) is reduced to the form with the augmented 
matrix A. 
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If the constant term 82, is distinct from zero the new linear system 
(and, consequently, the original one) is inconsistent. Indeed, the (r + 1)th 
equation takes the form 


0-xy + 0-0 +... + Om = BO, 40, 
This means that there exists no n-tuple of numbers turning the above equa- 
tion into identity. 
Let 892, = 0. Then only the first г rows in A’ are distinct from zero. 
To simplify the notation we set 


k = 2, Кз = 3, 2 К = ГР. 


(This may be made by renumbering yi = xi, Y2 = Xk.) ..., Yr = Хк, ....) 
The linear system takes the form 


ax + ах +... + ах, +... ox, = Be, 


ax +... + ах, +... + ox, = BR, (*’) 
aO x, Fesat aR x, = B®, 


where a(? x 0, «2 2 0, ..., af? #0. Then 
(1) The number п of unknowns in (*’) is equal to the number r of 
equations, ie, г = n. Then the system (*’) takes the form 


ax + ах: +... 84 ee ee + ox, = 89), 
ax; +... + a1 %n-1 Ха = BY, 


-1 -1 -1 
aa ee ee + a Me = Bp. 


aO x, = Be”. 


where of #0, k 21, 2, .... 

The last equation defines uniquely the value of x,. Substituting the 
value of x, into the (n — 1)th equation, we get the value of хп і. Then 
substituting the values of x, and x,- ; into the (n — 2)th equation, we ob- 
tain the value of x,-2. Continuing this process, we obtain the values of 
Xn -35 Xn-4, ...› Хз, X2. Finally, substituting the values of Хэ, Xs, ..., Xn 
into the first equation we obtain the value of x;. 

Thus the system (*’) has a unique solution and so does the system (*). 

(2) The number n of unknowns in (*') is greater than the number r 
of equations, ie, n > r. 

We set the unknowns Xr+1, Xr«2, ..., Xn equal to arbitrary numbers 
^fr 1» Yr 25 +++) Үл, Then transposing the corresponding terms to the right, 
we have the system 
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1 1 1 
af? x + ах: Tec af x, E gp» = а. 'Yyre1 — eee — os, 
2 2 
«® +... + ах, = BP — a La ун... — as, 
«Рх = BP -afai Yrer e m отуп. 


Similarly to the case (1) we obtain the values of x,, Xr-1, ..., X2, X1. 
Since yc. i1, Yr+2s ---» Yn are arbitrary numbers we infer that the system 
has infinitely many solutions. > 

Example. Solve the system 


3x1 — 5x2 + 2x3 + 4x4 = 2, 
7x — 40 + хз + 3x4 = 5, 
5x1 + 7x2 — 4x3 — 64 = 3. 


за [et us arrange the augmented matrix 
а 3 -5 2 4|2 
A=[7 -4 1 3 |5 
5 7 -4 -6] 3 
and reduce A to schematic form. 
Step 1. To get the matrix with ад = 1 we subtract the first row multi- 


plied by 2 from the second row and interchange the first row and the ob- 
tained second one. We have 


А 1 6 -3 —5 |1 

А-13 -5 2 4 12 

5 7 -4 -613 
Subtracting the multiples of the first row by 3 and 5 from the second 


and third rows, respectively, we get 
1 6 -3 -5 1 
о -23 п 19| -1 
01-23 И 19 -2 

Step 2. Subtracting the second row from' the third one, we have 


1 6 -3 -5 1 
0|-23 11 19 | -1 
0 0 0 0| -1 


Since the rank of the coefficient matrix is equal to 2 and the rank of 
the augmented matrix is equal to 3 we conclude that the linear system is 
inconsistent. > 
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Example. Solve the system 


2х1 + 5x; — 8х3 8, 
4x1 + 3x2 — 9x3 9, 
2x1 + 3x2 — 5x3 = 7, 
хі + 8x2 — 7x3 12. 


ча Arrange the augmented matrix 


2.5 -8 8 
43 -9 9 
23 -5 7 
1 8 -7 12 


> 


Step 1. ps the first and fourth rows, we have 


: Е 12 
9 
7 
8 


Subtracting the multiples of the first row by 4, 2 and 2 from the second, 
third and fourth rows, respectively, we get 


E 12 
Не -39 
-13 -17 
-1 —16 


Step 2. To simplify computations we subtract the third row multiplied 
by 2 from the second row and then the fourth row from the third one. 
We have 


1 р -7 12 
Н 1 -5 
Н 2 3 -1 
01 -1 6 - 16 
Subtracting the third row from the second one, we get 
1 8 -7 12 
0| -1 -2 -4 
0| -2 3 -1 
01-1 6 —16 


Subtracting the multiples of the second rows by 2 and 11 from the third 
and fourth rows, respectively, we obtain 
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1 8 —7 12 

0| -1 -2 | -4 

0 0 7 7 

0 0| 28 28 


Step 3. Subtracting the third row multiplied by 4 from the fourth row, 
we have 





The system is consistent since the rank of the coefficient matrix is equal 
to the rank of the augmented matrix, that is, r(A) = r(A) = 3. Since the 
rank is equal to the number of unknowns the system has a unique solution. 

Thus the original system is equivalent to the system 

ху + 8x2 – 7x3 = 12, 
— 2-24 = —4, 
X3 = 1. 

From the third equation we have x; = 1. Substituting x; = | into the 
second equation, we have —х› — 2 = —4. Whence, x; = 2. Substituting 
the values of хэ and хз into the first equation, we get xy + 16 — 7 = 12. 
Whence x; = 3. Thus the system has the unique solution x; = 3, x? = 2, 
ху = 1. > 

Example. Solve the system 

3x, — 20 + 5x3 + 434 = 2, 
бху — 4x2 + 4x3 + 324 = 3, 
9x, — 6X2 + 3x3 + 2x4 = 4. 


ча Arrange the augmented matrix 
3 -2 5 4 2 
6 -4 4 3 3 |. 
9 -6 3 214 


Step 1, Subtracting the multiples of the first row by 2 and 3 from the 
second and third rows, respectively, we have 
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3 -2 5 4 2 
0 0|-6 -5 -1]. 
0 01-12 -10 = 


Step 2. Subtracting the multiple of the second row by 2 from the third 
row, we get 





The system is consistent (r(A) = r(A) = 2) and has infinitely many so- 
lutions since the rank of the coefficient matrix is smaller than the number 
of unknowns (r(A) < 4). 

The original system is equivalent to the system 

3x, — 2x2 + 5x3 + 44 = 2, 
— 6x; — 5x4 = - 1. 
Let us find the solution of this system. 


Set x; and x4 equal to arbitrary numbers y; and y4. Then transposing 
the corresponding terms of the system to the right, we have 


3x, + 5x3 = 2 + 272 — 4y4, 
6x3 = 1 — 5+. 


From the last equation we get 
1 
x =z (l — 59) 


where y4 is an arbitrary number. 
Substituting the expression for хз into the first equation, we obtain 


x = (7+ 12y + и), 


where y? and y4 are arbitrary numbers. 
The general solution of the system is 


х= iunc) xo ys ail- Sy) xe 


Any specific solution is obtainable from the general one by giving 
specific values to y2 and y4. For instance, if we set xo = y2 = 1 and X4 = 
ya = —1, we get x, = хз = 1. Thus the specific solution of the system is 


х= 1, x= l, Хз = l, x4 = —1. > 


154 5. Matrices. Determinants. Systems of Linear Equations 


Cramer's rule. Suppose we are given a system of n linear equations in 
n unknowns 


ах + 0120 +... + QinXn = 81, 
Q21X1 + 022X2 +... + Q2nXn = ›, (5.55) 
OniXi + OX To... + OnnXn = Bn 


or, in matrix form, 
AX = b. (5.56) 
System (5.55) is called the quadratic system of linear equations. 
If the square matrix A is nonsingular the system (5.55) is consistent 


and has a unique solution since r(A) = n. 
Pre-multiplying (5.56) by the matrix A ^ 


X-A'!b 


1 inverse to A, we have 


or, using formula (5.39), 


xi Ап А ... Am В: 
х _ 1 [ ár Ao ... Am В 
: VAN Us usu cem : 
Xn Ain Aan Ann Bn 


After a little algebra in the right-hand side of the above expression, we 
have 


г, Bkk /=1,2,...,‚,п 


or, as quotient of determinants, 


J 
O11 Qi2... By Qin 
оол 022 82 соп 
Oni On2 Bn Onn 
Xj = , J = 1, 2, » n, 
Q1 912 ay Qin 
021 022 o2j O2n 
Onl Qn2 Оп] Onn 


(5.57) 


5.5 Systems of Linear Equations 155 


where the numerator is the determinant of the matrix obtained from A 
by substituting the column-vector b for the jth column and the denominator 
is the determinant of the coefficient matrix A. 

It is important to mention that Cramer's rule (5.57) is mainly of theoret- 
ical interest and is of no use for practical purposes it being awkward to 
solve linear systems except those in two or three unknowns. 

Remark. Cramer’s rule involves n + 1 determinants of order n thus re- 
quiring a vast amount of direct computations which is proportional to n!n 
and far exceeds the number of arithmetic operations required by the Gaus- 
sian elimination. When n — 30 the task of computing the solution of the 
linear system becomes time consuming even for advanced computers. The 
total number of arithmetic operations involved in Gaussian elimination is 
proportional to л”. 

To solve the quadratic linear system 


AX = b, 


where A is a nonsingular square matrix, we should proceed as follows: 
Step 1. Arrange the augmented matrix 


Qj O2 ... Ол 8: 
9 о о e. Q 
A=(Alpy=( “= =. о р 

Oni An2 ... Onn Bn 


Step 2. By elementary row operations reduce A to the form 


1 12 ee Qin B 
А)=[° 1 >> | Б E 
0 0 ... 1 Bn 


where A is a triangular matrix. 2 
Step 3. By elementary row operations reduce (А ІБ) to the form 


1 о... 0 yı 
Gis 9 1-0], 
0 1 sez Л Yn 


where J is an identity matrix. 
Step 4, Write down the system corresponding to the augmented matrix 
(11е) 


xi = yı 
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The n-tuple 
X1 = yr, X2 = ү, ..., Xn = Yn 


is the solution of the original system. 

Homogeneous linear systems. The system of т linear equations in п 
unknowns is called homogeneous if all constant terms are equal to zero, 
ie, the homogeneous linear system is given by 


О11Х1 + 012% +... + AinXn = 0, 
Q21X1 + 0220 +... + «эһХл = 0, (5.58) 
Om1X1 + ат2Х to... + OmnXn = 0. 


We shall consider some important properties of homogeneous linear 


Systems. 
(a) Any homogeneous system is consistent. 
ча The n-tuple xı = 0, x2 = 0, ..., Xn = 0 is the trivial solution of any 


homogeneous system. > 
(b) If the number 7 of the unknowns exceeds the number m of the equa- 
tions the homogeneous system has nontrivial solutions. 
ч By definition the rank r of the system (5.58) satisfies the inequality 
г< m < n. Whence we infer that the linear system is indeterminate. > 
(c) The sum of solutions of the homogeneous system (5.58) is a solution 
of (5.58). 
* Let yj, Y2, ---» Үл and ү, yz, ..., Үл be two solutions of (5.58). This 
means that 


n n 
> ejy/2.0 and У) аү/= 0 
jz1 itia 


for any i (і = 1, 2, ..., m). 
Since 
n n n 
M ay(yj+ yj) = > aijyj + > ауу = 0 (= 1, ..., т), 
j=l j=l j=l 
the n-tuple 
Yit YO yt Y o. Үп + Үл 


i.e., the sum of the solutions of (5.58), is also the solution of the homogene- 


ous linear system (5.58). > 
(d) The product of the solution of (5.58) by an arbitrary number is also 
a solution of (5.58). 
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< Let yı, Y2, ..., Үл be a solution of (5.58), i.e., 
n 
>) ayy = 0 (= 1, 2, ..., т) ` 
j=1 


and let » be an arbitrary number. 
Then 


n 


n 
У айру) =н 21 оу = 0 (i=l, 2, ..., т). 
1-1 4-1 
Whence we conclude that the n-tuple цул, руг, ..., иүп, i€., the product 
of the solution yı, y, ..., ya by the number y is the solution of the system 
(5.58). > 

In matrix form, the homogeneous linear system may be written as 


O11 O12  ... Qin xi 0 
021 022 леа Qn X2 2 0 
Omi Qm2 Omn Xn 0 


or, as the matrix equation 
AX = 0. (5.59) 


Then the properties (c) and (d) may be formulated as follows. 

(c^) If the column-vectors T'" and T'" are the solutions of the system 
(5.59), so is the column-vector Г’ + T". 
ча Indeed, if Г’ and Г” are the solutions of (5.59), then AI’ = 0, 
АГ” = 0 and А(Г’+Г”) = АГ’ + AT" =0+0=0, so that the 
vector-column Г’ + Г” is the solution of (5.59). > 

(d’) If the column-vector Г is the solution of (5.59) so is Г where н 
is an arbitrary number. 
< Indeed, if Г is the solution of (5.59) then AT = 0 and A(zI) = „(АГ) = 
00 = 0, so that the product of the column-vector Г by an arbitrary number 
р is the solution of (5.59). > 

The properties (c’) and (47) imply that the solutions of the homogene- 
ous linear system with the induced operations of addition and multiplica- 
tion by an arbitrary number form a linear space." Let us acquaint ourselves 
with an important property of this linear space. 

By the method of Gaussian elimination we may reduce the homogene- 
ous linear system (5.58) to the form 


? The notion of linear space will be discussed in Chap.6 
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Xı + Вх +... + Bix, + B, Xr d +... + BinXn = 0, 
Xx+... + B2rXr + B2,r+1Xr+1 +... + BonXn = 0, (5.60) 


Xr t Br,r+1Xr+1 dÉ eb BrnXn = 0. 


Allowing for renumbering of the unknowns we may consider Ху, X2, ..., 
x, as the principal unknowns and the other unknowns as indeterminate ог 
disposal unknowns, 

Let the rank r of the system (5.60) be smaller than the number n of 
the unknowns, that is, r « n. 

We compute n — r basic solutions by giving to the indeterminate 
unknowns X,41, Xra2, ..., Xn the values from the following table 


(5.61) 





Each row in (5.61) determines the solution of the system (5.60) so that 
we have the collection of n —r solutions 


yii “12 \- 
'y21 122 
“үл Yr2 
1 2 0 
Г, 0 , T; x 1 , 
0 0 
0 
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Yl,n-r-1 Yl,n-r 
Y2,n-r-1 Y2,n-r 
EE Г,-,-1 = Yr,ın-r-1 , Ir-r = Yr,n-r 
0 0 
0 0 
1 0 
0 1 


The solutions Гу, Їл, ..., Г„-, are linearly independent. 
~ Consider the linear combination 


Broil duoc Г Мр 


с-1ү11 + Иг-2112- +... + рату, п-г 
Ber +121 + pr + 27ү22 +... + рптү2,п- г 


= Br tyr + pr+2Yr2 +... + рпг, пг . (5.62) 
г-1 
Br+2 
Ёт-1 
Bn 
It is easy to see that the linear combination (5.62) is equal to the zero 
column-vector if and only if ш +} = pr+2 =... = pn-1 = pa = 0. Hence, 


it follows that the zero (trivial) solution of the system (5.60) is equal only 
to the trivial linear combination of the solutions Fi, F2, ..., r-r. P 
The properties (c') and (d’) imply that given arbitrary scalars иг-л, 


Br 2, ..., ил, the linear combination (5.62) is the solution of the system 
(5.60). 
я Let 
pı 
p2 
Г = ш (5.63) 
Ur+1 
Hn 


be an arbitrary solution of (5.60). Then (5.63) may be represented as a linear 
combination of the form (5.62). 
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Multiplying the solutions Ty, Г, ..., Їл-» by usi, pra2, ...› Їл» 
respectively, and adding the multiples obtained, we get the solution of (5.60) 
as the linear combination (5.62). 

Comparing (5.62) and (5.63), we easily see that both comprise the same 
values of the indeterminate unknowns pr+1, ре+2, ---» ил. Observe that 
the values of the indeterminate unknowns uniquely determine the values 
of the principal unknowns. Hence, the solutions (5.62) and (5.63) are identi- 
cal and 


Icgwabl uua +... Гь с. > (5.64) 


Therefore the solutions Гү, Го, ..., Г„-, of the homogeneous system 
(5.58) are such that (a) they are linearly independent and (b) any solution 
of (5.58) may be represented as a linear combination of T1, F2, ..., Г,-,. 

Definition. Any collection of n — r solutions of the homogeneous sys- 
tem (5.58) that satisfy the conditions (a) and (b) given above is called the 
fundamental system of solutions of (5.58). 

Example. Compute the solutions of the system 


3x1 — 2x2 + 2x3 + 4x4 = 0, 
бх — 40 + 4x3 + 3x4 = 0, 
9x; — 6x2 + 3x3 + 2x4 = 0. 


ча By the method of Gaussian elimination we have 


3x, — 2x2 + 5x3 + 4x4 = 0, 
— 6x3 — 5x4 = 0. 


Let us choose x» and x4 as indeterminate unknowns and arrange the 
table 





The fundamental system comprises the solutions 


2 1 
3 0 
Bt ig pol ш 
0 18 
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Any solution of the given linear system can be expressed as 


2 1 

3 0 
Г=џ 0 + yp -15 3 

0 18 


where u and р are arbitrary numbers. В» 

Summing up, it is important to mention that it suffices to find the fun- 
damental system of solutions to describe the set of solutions of the 
homogeneous linear system since the set of all linear combinations of the 
fundamental solutions forms the desired set. It is also easy to see that table 
(5.61) represents the identity matrix of order n — r. However, this is one 
of the many ways to identify values of the indeterminate unknowns. Table 
(5.61) may contain any other nonsingular matrix of order n — r. Finally, 
we emphasize that any homogeneous linear system having nontrivial solu- 
tions possesses the fundamental system of solutions. 


Methods of solving linear systems. In general the methods used to com- 
pute the solutions of linear systems can be classified as direct and indirect 
(or iterative). 

Direct methods are based upon Gaussian elimination with some inessen- 
tial modifications aimed at diminishing the number of operations required 
so that it remains proportional to и?. When input data, i.e., the coefficient 
matrix À and the constant terms in (5.52), are defined exactly and the com- 
putations are free from rounding-off errors, the direct method yields the 
true solution as a result of finite sequence of steps or operations. 

Indirect methods provide the approximate solutions by giving sequences 
of approximate solutions that start from some initial approximation. The 
complete cycle of computing the new approximate solution from the origi- 
nal one is called the iteration. So the indirect methods are often called the 
iterative ones. They involve a finite number of iterations to reach the ap- 
proximate solutions with given precision. 

In what follows we consider two methods, one direct and the other in- 
direct, which are of practical interest in a variety of applications. 

Gaussian elimination for linear systems with tridiagonal matrices. Of 
practical interest are linear systems with tridiagonal coefficient matrices 
which are square matrices A = (aj) whose nonzero elements are adjacent 
to their principal diagonals so that aj = 0 if |i — jl! > 1. 

Let 


bi €1 0 0. 0 0 0 
a2 b; C2 0. 0 0 0 
A= 0 аз by Сз 0 0 0 
0 0 0 0 . @п-1 Dn -1 Сп-1 
0 0 0 0. 0 ап bn 


11—9505 
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be a tridiagonal coefficient matrix. Then the corresponding linear system 
takes the form 


Diti ХЭ iL Bak Where UE Ses OAS ENIM сийн = а, 
aX Dx» HOON i heise Bo BAe Red = d, 
QiXj—1 + Dixi + СХ... = di, 


аһХп-1 + DnXn =d, 
where i = 2, 3, ..., n— 1, or 


diXi-1 + bixi + CiXi41 = di, (5.65) 


where i = 1, 2, ...,n, а = 0, Cn = 0. 
Let us apply the method of Gaussian elimination to (5.65). 
We assume that the unknowns x; are related as 


Xi = Qi+1Xi+1 + Bist (= 1, 2, ..., n), (5.66) 


where oj41 and 8;.; are unknown quantities. 
Diminishing all the subscripts in (5.66) by unity and substituting the 
obtained relations into (5.65), we get 


а: (ох: + Bi) + Bix; + cixi. = di. 
Whence 


Ci aiBi — di 


Xi = - T——— Xi 1 PRESTA 
Е bi + ао; "t 


СЪ + aiai 
This expression is identical to (5.66) if 


Ci 


_ aiBi + di 
bi + aiai 


ИЕ da 


i+. = 
To start the computations we also need to know the values of a1, 1, 
ха +1. These are usually set equal to 


a, = 0, Bi = 0, Xn+1 = 0. 
Collecting all the formulas together, we may describe the method as 
consisting of 
(a) elimination stage 
Ci 


Qi+1 = — y— ——— 
Е bi + aiai ’ 


aif; — di 
bi + ajay; d 


Bis1 a эг, 
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(b) back-substitution stage 
Xi = Qi+1Xi+1 + Bio, P=n,n—1, ..., 1, x41 = 0. 


Therefore on completing the elimination stage we obtain the values of 
the quantities о; + ; and 8B;.1 ( = 1, 2, ..., n) and, consequently, the value 
of the unknown xn. Then substituting x, into the expression relating x, 
and x,-1, we get the value of х„-1. Substituting х, і into the relation 
which comprises x, – ; and х, +2, we get the value of x,-2. Continuing this 
process, we finally compute the values for all the unknowns xi, X2, ..., Xn. 


Method of simple iteration. Suppose that we are given a linear system 
(5.52) and start the computations from some initial approximate solution 
Yo. 

Let Y, and Ү; + be two approximate solutions obtained by completing 
the Ath and (k + 1)th iterations, respectively. Suppose that Yx and Yxi1 
are related as 


Yer Pe Ay, = b (k =0, 1, 2, ...) (5.67) 


or 
Yui = Yk — (AY, — Б), (5.68) 


where т (т > 0) is a real number called the stationary parameter. 

The procedure which employs the recurrence relations (5.67) and (5.68) 
to compute the approximate solution of the system (5.52) is called the 
method of simple iteration. 

In terms of unknowns the relation (5.68) is written as 


уйл = 00 - т ( È af – 8) (712, ..,2. (569) 


The precision of the method is given by the value of the error 
7к= Y- X 
ie, the value of the difference between the solution of (5.68) and the true 
solution X of the system (5.52). 
Let є (€ > 0) be a given relative error used to estimate the accuracy of 


the approximate solution Yx. The process of the computations terminates 
when the following condition is satisfied 


n n 
121 = | У) 1012 <e | Y 1201? = 1201. 
i=1 i=1 


The iterative method is said to converge if 
lim 1241 = 0. 
К- 
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Suppose that the coefficient matrix A = (о;у) of (5.52) is symmetric and 
all the eigenvalues of A are positive, i.e., 0 < »; < № <... < № (see Chap- 
ter 6). 

Theorem 5.10. Whatever the initial approximation Yo the method of 
simple iteration converges if the stationary parameter т satisfies the con- 
dition 

ке. (5.70) 
An 


Example. Given the initial approximation 


ъ= (o) 


and the relative error € = 0.3, use the method of simple iteration to com- 
pute the solution of the linear system 


2x1 + X=l, 
ху + 20 = 2, 


or, in matrix form, 
21 Xi _ 1 
12/\%/ \2/` 
45 Let us compute the eigenvalues of the symmetric matrix 
21 
12/7" 
We have 


2-3Х 1 28 2 EN 
| ‚-›|-®-2 -1-0, 


Whence Мм = 1 and M = 3. 
Put 
1 


т=у = 05. 


Thus condition (5.70) is satisfied. Then from (5.68) we have 


Your = Ye - 0.5 TE 2) E 0) 


Whence 


TORORO] 





Exercises | 165 
_ [05\ _ 21ү(05ү (1 
ње (1°) о 172) (18) - 6) 
_ [0.5 — 05 2 -1ү, (05-05 ү (0 
СО 7 125-2 1 — 0.25 0.75) ` 
Let us estimate the precision of the approximate solutions. Since 


=() 


is the true solution of the system, then 
1201 = IYo- XI = УР = 1, 








17\1 = IY ХІ = V0.5) = 0.5, 
1251 = ЬУ» — ХІ = N(025? = 0.25. 


Whence we infer that the approximate solution 


_ [0 
n (25) 


obtained after the second iteration meets the accuracy required. > 


Exercises 


1. Multiply the matrix A by the matrix B: 


10 _ f0 0. _ [00 | {10 
@ A= (16). B= ({ » wa- (i 2): B= (9): 
a 


1 


c 
2. Multiply the matrix ( 1 7) by itself. 
001 


3. Compute the product AB and BA for А = (2 -3 0) and 


-0 


1111 
1 : 1212 
4. Reduce the matrices to schematic forms: (a) 1131 F 
1214 
12 3 4 
3.4 5 6 
Ol 5 6 78 
31 23 55 42 
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0123 
: 1012 
5. Compute the values of the determinants: (a) 21011! 
3210 
123 4 1 234 
3 6 8 I 3.4 5 6 
(0) 17 52025: © |5 6 7 9 
31 23 55 42 31 23 55 42 
5 4 1 -3 -1 
6. Invert the matrices: (a) ‚ (6) {-2 7 2], 
-8 6 3 E 
0 1 11 
-1 0 11 
Or 01 
-1-1-10 
123 147 0 
7. Find the ranks of: (а) ; (b> (258 Il |; 
1 2 369 12 
10 И 12 


1 
(c) à 
1 


Xy + Х2— Хз = 1, 
8. Compute the solutions of the linear systems: (a) Xi — X2 + 23 = 


| 
_ 
2 


—XxitX2—X3- l; 


7x1— 5x2— 2x3 — 4х = 8 
х= жю + 223 = 2 , 
: -3x + 2x2 + x + = -3, 
o farat 5--L (0 4 2х2 хь- Gz i 
2n - x»- »--2; —Хх\ + хз + 24 = l; 
-3x + X X= 0, 
(d) 5x1 + X2—233- 2, 
—-2x|— 2X2 + x = —3. 
3x, + 20 + x3 = 0, 
9. Compute the solutions of the systems: (a) 4 7x1 + 6x2 + 5x3 = 0, 
5x1 + 4x2 + 3x3 = 0; 


— Exercises |. ——— 167 


-Х1 + хз + 2x4 = 0, хі + 4x2 + 223 - 3x = 0, 
(b) — Xt хз + 2x4 = 0, (с) ) 2x1 + 9x2 + 5x3 + 2x4 + x5 = 0, 
Tx — 5x2 — 2x3 — 4x4 = 0, Xi + 3x2 + X3 — 2x4 — 9х5 = 0. 


4x, — 3x2 — хз — 2x4 = 0; 


Answers 
1 2a ab + 2c 
1. 0) (°°); wm (°°). 2 [от 2 
00 10 00 1 
8 -120 
3. АВ = (-D, ВА = [ 6 -90 
2 -30 
1111 1 2 3 4 
0101 0 -2 -4 -6 
4af 0020 [5 ОТ о о 40 35 
0002 о о о о 


1 (64 32 14 -1 
5. (a) —12; (b) 5; (c) 80. 6. (a) 40 J (b) 2 1 Of; 


25 -1 
0-1 1- 
1 0-1 
Of -1 1 0-1 
1-1 1 0 


7. (а) 2; (5) 2; (03; 8. (а) х = 1, x2 = 1, 3 = 1; б) m= -l, ж» = -L хз = 1; 
(с) xi = —1— уз + 2y4, Xo = —3 + y3 + 2y4, Хз = Y3, X4 = ya; (d) the system is inconsis- 
tent. 9. (a) х = хз, х = —2xs; (b) xi = x) = x + 2:4; or 


or = А + д 


& 
ое 
-ONN 


2 8 
1 2 
(c) Xs = № 1 + р 0 
0 1 
0 0 


Chapter 6 


Linear Spaces and Linear Operators 


6.1. The Concept of Linear Space 


A set V of mathematical objects x, y, Z, ..., called the elements of 

V, is said to form a linear real or complex space if (1) given two arbitrary 
elements x and y in V there exists an element x + y in V called the sum 
of x and y, and (2) given an arbitrary element x in V and an arbitrary 
real or complex о there exists an element ox in V called the multiple of 
X by o, and the induced operations of addition and multiplication by a 
scalar satisfy the following eight conditions 

(а) (x - y) * z 2 x + (у + 2); 

(b)x+y=y+x; 

(c) there exists an element @ in V such that for any x in V there holds 


х-0-х, 
(d) for any x in V there exists ап element (—x) in V such that 
х-(-х)-0, 


(e) a(x + y) = ox + ay; 

(f) (œ + Вх = ox + Bx; 

(в) а(0х) = (a8)x; 

(h) Lx = x. 

The elements x, y, z, ... of the linear space V are often called the vectors, 
the element 6 is called the zero vector and the element (— x) is called the 
additive inverse of the vector x. The linear space is also called the vector 
space. 

Examples of linear spaces. The following sets of mathematical objects 
form linear spaces: 

(1) The set V3 of geometric free vectors with the induced operations 
of vector addition and multiplication of a vector by a scalar described in 
Chap. 2 (Fig. 6.1) as well as the set V, of vectors іп a line and the set 
V2 of vectors in a plane. 

(2) The set of sequences (£!, £, ..., £”) of n real numbers. The operations 
of addition and multiplication by a real number are induced as 

(a) Addition 


(, Р, ы £") + a, "T, heat d 1") 


= (Еа nl opo Еп L „тү 
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(b) Multiplication by a real number 
МЕ, £, .., E) = ОЕ, АР, ...‚ MP). 


In symbols R” designates the n-dimensional real coordinate space 
(Fig. 6.2). 


Aa(A«0) 





(a) (9 
Fig. 6.1 





(AE, АЕ) 
(A>0) 





(e) 


Fig. 6.2 


(3) The set R”*" of m x n matrices, discussed in Chap. 5, with the 
operations of matrix addition 


ајр 0122 0. бал Bun Вә .. би 
ај О22 2. Ойл " Ba B22 .. Ban 
Om] Om2 Omn Bm Bm2 Bmn 


оп + Ві ор + Biz... ain + Bin 
azı + B21 022 + 2 2. On + бол 


Omi + Ümi Qm + Bm2 .. атп + Втп 
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and multiplication of a matrix by a real number 


а O12  .. Qin hai Ае; .. Ао 
\[ 92: 002 .. Ол 2; Моор Ае; .. Хал 
Om Om2 Omn AQm1 №т2 AOmn 


In particular, the set Ї х" of row-vectors, ie., 1 х п matrices, and the 
set R”*! of column-vectors, ie, т x 1 matrices, form linear spaces. 

(4) The set C(0, 1) of real functions continuous on the interval (0, 1) 
with the usual operations of addition of functions and multiplication of 
a function by a real number. 

In all these cases the validity of the conditions (a)-(h) are directly 
verified. 

Properties of linear spaces. (a) The zero vector 6 is uniquely defined. 
"4 Let 0; and 0› be zero vectors іп V and let 0; + 0; be a sum of 6, and 
0». Since 62 is the zero vector the condition (c) implies that 81 + 0; = 61. 
The vector 6; is also zero. Then б, + 06; = 62 + 0, = 02. > 

(b) For any vector x the additive inverse vector (— x) of x is uniquely 
defined. 
"* Let x” and x. be two additive vectors inverse to x. Let us show that 
X and x. are equal. 

Consider the sum x- + x + x .Sincex is inverse to the vector x the 
condition (a) implies that 

Х-+Х+Х -х +(х+х ) 
Analogously we have 


X- -0- x.. 


х-+Х+Х =(x_-+x)+x -0-4Х =x. > 


It is easy to verify the following properties: 

(c) for any vector x there holds Ox = 6; 

(d) for any vector x there holds —x = (—1)x; 

(e) for any real number a there holds af = 6; 

(f) the identity ax = @ implies that either а = 0 or x = 6. 


6.2 Linear Subspaces 


A non-empty subset W of a linear space V is said to form a /inear 
subspace in V if given arbitrary vectors x and y in W and any number 
a, the following conditions are satisfied: 

(i) x + y is an element of W and (ii) ax is an element of W. The set 
W is said to be closed with respect to the operations (i) and (ii). 


Examples of linear subspaces. (1) The set V2 of vectors in a plane forms 
a subspace of the linear space V3. 
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(2) The solution of the homogeneous system of m linear equations in 
n unknowns 


a1 O12 we) Ойл Xi 0 
ол az бэл x» \_[ 0 
Omi Om2 Omn Xn 0 


or, in matrix form, 
AX =0 (*) 

forms a linear subspace. 
ча In Chap. 5 we have shown that the sum of the solutions of (*) and 
the multiple of the solutions of (+) by a number are also the solutions of 
(ж). Whence we infer that the set of the solutions of (ж) forms a linear 
subspace of the linear space R^*!, > 

(3) The set of all functions f(x) such that each f(x) is equal to zero 
at x = 0, ie, f(0) = 0, forms a subspace in the linear space C(— 1, 1) 
of real-valued functions continuous on the interval (-1, 1). 
“4 It is easy to verify that the set 


Co( - 1, 1) = (/00//0) = 0] 


is a linear subspace since both the sum f(x) + g(x) and the multiple af(x) 
are equal to zero at x = 0 when f(0) = g(0) = 0. » 

From the definition of linear subspace it follows that if x1, x2, ..., Xg 
are vectors in W so is any linear combination ах; + o2X2 + ...  agXq. 
Also, for any linear subspace W the conditions (a)-(h) are satisfied. This 
means that any linear subspace of linear space is itself a linear space. It 
suffices to show that both the zero vector and the additive vector inverse 
to an arbitrary vector in W belong to W. 

*4 Let x be an arbitrary vector іп W. Then multiplying х by 0 and - 1, 
we get the zero vector 0 = Ox and the additive vector —x = (—1)x inverse 
to x. Obviously, the other conditions are also satisfied. > 

Sums and intersections of linear subspaces. Let V be a linear space and 
let W, and УУ: be linear subspaces in V. 

The sum W, + W2 of the linear subspaces Wi and W2 is the set of 
all vectors x in V such that 


х= х +X, (6.1) 


where x; is a vector іп W; and x2 is а vector іп W2. 
In symbols, we write the sum of УУ, and W2 as 


Wi; + №, = {х= х + X2|x1 € Wi, Хэ € W2). 


We show that the sum W, + УУ: is a linear subspace in V. 
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ча Let x and y be two arbitrary vectors іп УУ: + W2. By definition of 
the sum of linear spaces there exist xy and y; in Wi, and x2 and y» in 
W^» such that 

х= х + х and y=yi+ у. 


Whence the sum х + y may be written as 
X t y = (х + X) + (у + уз) = (Ху + уз) + (хә + y2). 


Since x; + yı belongs to W; and x? + у» belongs to W2 the sum x + y 
is a vector in W; + W2. 

Similarly, we can prove that ox is a vector in W; + W;. > 

The sum of the linear spaces W; and УУ: is said to be direct if the expan- 
sion (6.1) is unique for any vector x in №; + УУ: (Fig. 6.3). 





Fig. 6.3 


In symbols, we write W; @ УУ: to designate the direct sum of W, and 
W2. 

The intersection № ПҸ of the linear subspaces №; апа W2 of the 
linear space V is the set of all vectors x that belong both to W; and to W2. 


Properties of sums and intersections of linear subspaces. (a) The inter- 
section №; ПҸ: is non-empty since №; П Ұ: always contains the zero 


vector. 
(b) The intersection W; N УУ: is a linear subspace of the linear space V. 
(c) If the zero vector is the only vector contained both in W; and in 
W2, the sum of W; and W2 is direct (denoted by УУ: Ф W2). 


Linear span. Let V be a linear space and X be a subset of V. The set 


q 
L(X) = р = >) axj|xjeX; oj; 4-1, 2, «] 


1-1 
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of all linear combinations of vectros y in X is called the linear span L(X) 
of X. In other words, the linear span L(X) of X is the set of all vectors 
y that may be represented as linear combinations of vectors in X. 


Properties of linear spans. (a) The linear span L(X) contains the set X. 

(b) The linear span L(X) forms a linear subspace in V. 
ча This follows from the fact that the sum of linear combinations of vec- 
tors in X and the multiples of linear combinations by an arbitrary number 
are linear combinations of vectors in X. » 

(c) The linear span L(X) is the smallest linear subspace containing the 
set X. 

In other words, if the linear subspace W contains the set X, then W 
also contains the linear span L(X) of X. 
а Indeed, an arbitrary linear combination oix; + о + ... + agXq of 
vectors in X, being an element of L(X), is also contained in W. > 





Fig. 6.4 


Examples. (1) Let £ = (1, 1, 0) and з = (1, 0, 1) be two vectors in the 
linear space IR?. Then the set of solutions of the equation 


Е! + Е m £ = 0 (6.2) 


is the linear span L(£, 1) of the vectors £ and 3. 
ча Indeed, the triples (1, 1, 0) and (1, 0, 1) form the fundamental system 
of solutions of the homogeneous equation (6.2). Hence each solution of 
(6.2) is a linear combination of fundamental solutions (Fig. 6.4). > 

(2) Let C( — œ, оо) be a linear space of real-valued functions continuous 
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at every point of the number line and let 
X = (1, x, .., х") 


be a set of monomials 1, x, ..., x". 

Then the set of polynomials of degree not exceeding п with real coeffi- 
cients forms the linear span L(X) of X. 

In symbols, L(l, x, ..., x") designates the linear span of the set of 
monomials 1, x, ..., x" and M, stands for the set of polynomials of degree 
not exceeding ” with real coefficients. 


6.3  Linearly Dependent Vectors 


Definition. The vectors Ху, Хо, ..., X4 in the linear space V are called 
linearly dependent if there exist numbers ол, a2, ..., ош, not all zero, such 
that 


ах + O2X2 + ... + gX = 0. (6.3) 


If the equality (6.3) is satisfied by о; = az = ... = ag = 0 only, the vec- 
tors xi, X2, ...‚ Ха are called linearly independent. 

The following propositions are valid. 

Theorem 6.1. The vectors x1, Xo, ..., Ха (4 2 2) are linearly dependent 
if and only if at least one of these vectors may be represented as a linear 
combination of the others. 
<~ Assume that the vectors Ху, X2, .., Хү are linearly dependent. For 
definiteness we set a, ~ 0 in (6.3). Transposing to the right all the terms 
in (6.3) except the qth one and dividing all the terms by a, ~ 0, we have 

ха = D AME i „= @#® у AT MC. ү. 
94 Qq 9а 
ie, the vector x; is a linear combination of the vectors x1, Хо, ..., X4. 

Conversely, if one of the vectors is equal to the linear combination of 

the others 


B1X1 + Вж +... Bg 1X4 1 = Ха 
then transposing xg to the left, we have the linear combination 
B1X1 + В + ... + бе-1Хд-1 + (—1)Xq = 0 


that contains at least one nonzero coefficient (—1 = 0). This means that 
the vectors X1, Хо, ..., ха are linearly dependent. > 

Theorem 6.2. Let xi, x», ..., ха be linearly independent vectors and let 
y = aX) + 0202 +... + QgXqg. Then the coefficients a1, a2, .ч Од are 
uniquely defined by y. 
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я Let 
у = fy + oXo +... + Ваха. 
Then 
1X1 + O2X2 +... t AgXg = Вх + Вж + ... + Ваха. 
Whence 
(ол — B1)K1 (a2 — 82)Xo + ... + (oq — Ва)ха = 0. 
Since the vectors xij, Xo, .., ха are linearly independent, 
өл — бз = az – b2 =... = ад — By = 0. Hence, ar = Bi, а = fz, s 
«а = Ва. P 
Theorem 6.3. A collection of vectors containing linearly dependent vec- 
tors is linearly dependent. 


Fig. 6.5 


* Let Xi, Xo, ..., Xg, Хе +1, --) Xm be a collection of vectors such that the 
first q vectors are linearly dependent. Then 


О1Х:) + 022 +... + agXq = 6 


and not all o, a2, ..., o; are equal to zero. Adding to this linear combina- 
tion the multiples of x; + 1, Xg +2, ..., Xm by zero, we obtain the linear combi- 
nation 


ох + O2X2 +... + QgXg + Оха +1 +... + OXm = 6, 
where not all a; are equal to zero. > 


Example. The vectors in V3 are linearly dependent if and only if they 
are coplanar (Fig. 6.5). 


6.4 Basis and Dimension 


The sequence of vectors е, e», ..., e, in a linear space V is called 
the basis of V if ei, ei, ..., e, are linearly independent and any vector in 
V may be expressed as a linear combination of ёл, ei, ..., ёл. 
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We speak of the sequence of e1, ёо, ..., ёл to mean an ordered n-tuple 
of vectors. It is easy to see that by permutation of n vectors we obtain 
n! ordered sequences (collections) of n vectors. 

Example. Let a, b, c be three noncoplanar vectors in V3 (Fig. 6.6). Then 
the ordered triples (a, b, c), (b, c, a), (c, a, b), (b, a, c), (a, c, b) and 
(с, b, a) form distinct bases in V3. 

Let e = (ei, e, ..., ёл) be a basis in V. Then given an arbitrary vector 
x in V, there exists a collection of n numbers £!, £, ..., £" such that 


n 
x = fle, + Fer +... + Ee = У) Ee 
1-1 


Fig. 6.6 


By Theorem 6.2 the numbers £!, #2, ..., Ё", called the coordinates of 
the vector x relative to the basis e, are üniquely defined. 
Let x = Y tle; and y = Y nie; be vectors in V. Then 


i=l і= 1 


х+у= У tei + Y wee Ў, (Е + йе 


1-1 і= 1 i=l 
and, for any number o, 


n n 
ох-а > t'ei- (аа )ё. 
1-1 i=l 
Thus, addition of vectors becomes addition of their respective coordinates 
and multiplication of a vector by a scalar becomes multiplication of the 
coordinates by this scalar. 
The coordinates of a vector may conveniently be written in the form 
of a column-vector. For example, 
£e 
g 


р 
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n 
is the column-vector of the coordinates of the vector x = 2, £'e; relative 
(1 
to the basis e. 
Let 
: n . n А 
пе, X2= Уу Бе, ., ха = 2, &ei 


i=l i=1 i=l 


Ma 


ху = 


be expansions of the vectors x1, Хо, ..., Хү relative to the basis e and let 


Н 13) & 
ți i [& 
g Bg g 


be column-vectors of the coordinates of Ху, ..., Хо, ..., Xg relative to e. 

Theorem 6.4. The vectors хі, Хо, ..., ха are linearly dependent if and 
only if so are the column-vectors of their coordinates relative to some basis 
e. 
за Assume that the linear combination АХ: + № + ... + AgXq of the vec- 
1015 Xj, Хо, .., Ха is equal to the zero vector, i.e., 


q 
У) № = 
к-1 
or, in explicit form, 
q n | п q . 
21 «( > Че) 2 ( 2, M) е; = б. (6.4) 
к-1 “1 і=1 \k=1 


Recall that the expansion of a vector relative to a given basis is unique. 
Then from (6.4) we have 


а а 4 
DME = 0, У № -0,., >, tk 50 
ко K-1 k=1 


or, equivalently, 


Bü B Я 0 
м| 8 Jeu» 8 jea 1-0 (6.5) 
н E E 0 


This means that the linear combination of column-vectors of coor- 
dinates of xi, Хо, ..., Хү is equal to the zero column-vector. 

If we suppose that (6.5) holds, then reversing our arguments, we arrive 
at formula (6.4). Whence we infer that if a nontrivial linear combination 
Of X1, Хо, ..., Ха (ie. a linear combination with Ху, №, ..., № not all equal 
to zero) vanishes, then a nontrivial combination of column-vectors of the 


12—9505 
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coordinates of Ху, Хо, ..., Xz with the same numbers X, №, ..., Ag is equal 
to the zero column-vector and vice versa. > 

Theorem 6.5. Let a basis in V comprise n vectors. Then any system 
of m (m » n) vectors in V is linearly dependent. 
ча By Theorem 6.3 it suffices to consider the case of m = n + 1. 

Let Ху, Хо, ..., Xn «1 be arbitrary vectors in V. Expanding these vectors 
relative to the basis e = (ei, ez, ..., ел), we have 


xi = е + бе +... + 6 
х = е +e. +... + Hen, 


Xni = Ёле + Ёё +... + ERG en. 


Arranging the coordinates of Ху, Хо, ..., Хл+1 in matrix form, we obtain 
the n x (n + 1) matrix 


E oH os Bua 


EB Bua 


where the jth column comprises the coordinates of x; (j = 1, 2, ..., n + 1). 

Since the rank of K does not exceed the number 7 of rows, (n + 1) 
columns in K are linearly dependent column-vectors being at the same time 
the column-vectors of the coordinates of xi, Хо, ..., X1+1. Then by Theo- 
rem 6.4 the linear dependence of these column-vectors implies that the vec- 
tors X1, X2, ..., Xn 41 are linearly dependent. > 

Corollary. All bases in V comprise equal numbers of vectors. 
*4 Let e be a basis comprising п vectors and let e^ be a basis comprising 
n' vectors. By Theorem 6.5 the linear independence of ei , e2, ..., e; implies 
that n’ « n. On the other hand, by Theorem 6.5 the linear independence 
of ёл, ёо, .., e, implies that n < п’ Thus we have n = n’. * 

The dimension of a linear space V is the number of vectors contained 
in the basis of V. : 

Examples. (1) The basis of the real coordinate space IR" is formed by 
the vectors 


€i = (1, 0, ..., 0, 0), е = (0, 1, ..., 0, 0), ..., en = (0, 0, ..., 0, 1). 
+4 Indeed, the vectors ei, ёо, ..., e, are linearly independent since the 
identity 

aye) + a2e2 +... + Ane, = 0 
implies that 
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о1(1, 0, ..., 0, 0) + a2(0, 1, ..., 0, 0) 
+... tan(0, 0, ..., 0, 1) = (ai, oo, ..., Œn) = (0, 0, ..., 0). 
Whence о = a2 = ... = аһ = 0. 
Besides, any vector Ё = (£!, £%, ..., £") іп R” may be expressed as a linear 
combination of ej, ёо, ..., ёл, ie, 
E = £1, 0, ..., 0, 0) + 20, 1, ..., 0, 0) 
+... + #"(0, 0, ..., 0, 1) = (El, 2, .., E”). 
This means that the dimension of R” is equal to n. > 
(2) Recall that the homogeneous linear system 
апд + Q12X2 +... + оллХл = 0, 
O21X1 + 0222 +... + ОолХ = 0, 


peooooveeveeeceeeeceosesssecooooeevosevecvone 


OmiXi + Qm2X2 +... + OmnXn = ©, 


with nontrivial (nonzero) solutions possesses the fundamental system of 
solutions which forms a linear space. The dimension of this linear space 
of solutions of a homogeneous linear system is equal to the number of 
the fundamental solutions, i.e., to n — r, where r is the rank of the coeffi- 
cient matrix and n is the number of the unknowns. 

(3) The dimension of the linear space М, of polynomials of degree not 
exceeding n is equal to n + 1. 
*« Since any polynomial of a degree not exceeding n takes the form 


P(t) = ao + oit + ... + ant”, 


it suffices to show that the vectors е = 1, ez = £, ..., еп+1 = Ѓ" are linearly 
independent. 
Consider the identity 


aol + aif +... + ant” = 0, (6.6) 


where 1 is an arbitrary quantity. Setting t = 0, we have ao = 0. 
On differentiating (6.6) with respect (0 4, we have 


ол + 2057 + ... + попі"! = 0. 


Whence, setting £ = 0, we obtain о; = 0. 
Repeating this process, we arrive at the identity 


Oo = a1 = ... = аһ = 0. 


This means that the vectors e1 = 1, ez = f, .., €n41 = (^ are linearly 
independent. Hence, the dimension of М, is equal to n + 1. >» 


I 
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A linear space of dimension n is called the n-dimensional linear space. 
In symbols, we write dim V = n. 
By way of convention the dimension of the linear space V is considered 
equal to л everywhere through this chapter unless otherwise mentioned. 
It is clear that for any linear subspace W in V dim W x n. 
We show that the linear space V comprises linear subspaces of any 
dimension k < n. 
ча Let e = (ei, e, ..., еп) be a basis in V. It is easy to verify that the linear 
span 


Wi = L(ei, Ёл, хээ ex) 


is a linear subspace of dimension k. > 

By definition dim (0| = 0. 

Theorem 6.6. Let aj, a2, ..., ax be linearly independent vectors in a linear 
space V and k < п. Then there exist vectors ak +1, 8k «2, ..., Эл in V such 
that the vectors ai, a2, ..., ак, ак+1, ..., ап form a basis in V. 

* Let b бе an arbitrary vector in V. If the vectors ац, a2, ..., ак, b are 
linearly dependent then 


b = aya; + аа + ... + акак (6.7) 


because the linear independence of aj, az, ..., ax implies that the nontrivial 
linear combination 


Ма + Хаг +... + Акак + ub -0 


involves p = 0. 

Suppose that each vector b in V admits a representation of the form 
(6.7). In this case the vectors аг, a2, ..., ах form a basis in V by definition. 
However, this contradicts to the fact that the number k of the vectors is 
smaller than the dimension л of V. This means that there must exist a vector 
ак +1 in V such that the vectors a1, 32, ..., ак, ак + ı are linearly independent. 

If k + 1 = n the vectors ai, az, ..., ак, ak+1 form a basis in V. 

If k+ 1 « n we repeat the previous reasoning for the vectors a1, a2, 
cory dk, Ak +1- 

This process enables us to complement the collection of the vectors ai, 
аз, ..., ак by the vectors ак +1, ак +2, ..., ап SO that these n vectors form 
a basis in V. In other words, we may always construct a basis of V which 
incorporates the basis of a given subspace of V. > 

Example. Construct the basis of IR* by complementing the collection 
of the vectors a; = (1, 2, 0, 1) and a? = (- 1, 1, 1, 0). 
<4 We choose the vectors аз = (1, 0, 0, 0) and a4 = (0, 1, 0, 0) in IR^. Let 
us show that the vectors аг, 32, аз, a4 form a basis of Р“. 
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Consider the matrix 


1 
0 
0 
0 


о о ~ о 


1 2 
-1 1 
1 0 
0 1 
whose rows comprise the coordinates of a), ал, аз, a4. 
Notice that the rank of A is equal to 4. This means that the rows of 
A and, consequently, the vectors аг, a2, аз, a4 are linearly independent. 
Thus they form a basis of Р“. » 
We proceed similarly in the general case. 


Suppose we wish to construct the basis of IR" by complementing the 
collection of & linearly independent vectors 


ат = (011, O12, 2, Qin), 22 = (021, O22, ..., Оо), 
es Ak = (aki, Ok2, ч, On). 


By elementary row operations the matrix 


O1 012 .. Qin 
21 22 .. O2n 
Okt ак2 Okn 


can be reduced to the schematic form and then complemented by n — k . 
rows of the form 


(0, 0, ..., 1, 0, ..., 0) 


so that the rank of the matrix obtained becomes equal to n. 
Theorem 6.7. Let W; and W2 be linear subspaces in V. Then 


dim (Wi + W2) + dim (WiNW2) = dim W; + dim W;. 


6.5 Changing a Basis 


Let e = (e1, e, ..., ел) and e’ = (ег, ez, ..., ёл) be bases of V. Ex- 
panding the vectors in e^ with respect to the basis e, we have 


n 
ef = Mole, /=1,2,.‚п (6.8) 
i=l 
or 
ej = olei + ое» +. tajen, j=l, 2, ..., п. 
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In matrix form, (6.8) becomes 


at а} о} 
(eí, ei, ..., ex) = (е, е, ..., €n) орой ой . (6.9) 
ар оз On 
The matrix 
al ol al 
A= oi од од, 
ої о? On 


describes the transition from the basis e to the basis e’ and is called the 
transition matrix. 


Properties of transition matrices. (a) The determinant of A is not equal 
to zero, ie,det А z 0. 

Let us assume the converse, i.e., det A = 0. This means that the columns 
in A are linearly dependent. 
ча Since the column-vectors in A are the column-vectors of the coor- 
dinates of ef, ez, ..., e; relative to the basis e, Theorem 6.4 implies that 
ег, ei, ..., e; are linearly dependent vectors. However, this contradicts to 
the fact that e' is the basis of V. Consequently, the assumption that 
det A = 0 is false. Hence, det А # 0. » 

(b) If £', £, ..., E" and E'!, E'?, ..., £'" are the coordinates of a vector 
x relative to the bases e and e’, respectively, then 


Е Е 
ХАГ 01 (6.10) 
p p 


ча Substituting (6.8) into the formula 
n n 
x= у te - >) £"ej 
і=1 ј=1 
we obtain 
n й n . n " n n . К 
2; te >; e( 2 ales) = У) ( Z ale”) е. 
і=1 j=1 i=l i=l j=1 
Since, relative to a given basis, the expansion of a vector is unique, the 
above relation gives 


n 


f= oit" i-12,..,n. 


j=l 
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Arranging these expressions in matrix form, we easily obtain (6.10). > 
(c) The inverse A ^! of A is the transition matrix from the basis e’ to 
the basis e. 
ча To verify this property it suffices to post-multiply (6.9) by АГ.» 


6.6 Euclidean Spaces 


A real linear space V is called a real Euclidean space if, given two 
arbitrary vectors x and y in V there exists a number (х, y) (or x-y) such 
that for any vectors x, y and z in V and any arbitrary number o the follow- 
ing conditions are satisfied: 

(a) (x, y) = (y, x); 

(b) (x + y, 2) = (X, 2) + (у, 2); 

(с) (ох, у) = о(х, у); 

(d) (x, x) > 0; the identity holds if and only if x = 6. 

The number (x, y) is called the scalar product of vectors x and y. 

Examples of Euclidean spaces. (1) The space V3 of free vectors with 
the scalar product of a and b defined as 


(а, b) = [a||b] cos (a,b). 


(2) The real coordinate space IR" with the scalar product of two vectors 
E= (El, £, .., E") and q = (nl, 77, .., n”) in R” given by 


D= Nw 


(3) A linear subspace of the Euclidean space. 
In all these cases the validity of conditions (a)-(d) are directly verified. 
Employing the definition of the Euclidean space we can easily prove 
the following properties: 
(1) (0, x) = 6; 
(2) (x, ay) = ax, y); 
(3) (x, у + z) = (х, у) + (х, 2); 
т т 
(4) ( У ом, Ў) в) = у 2 aibi, yj). 
i=l j=1 1-1 j= 


Theorem 6.8. For any two vectors x and y of the Euclidean space there 
holds the inequality 


(х, у)? < (х, х)(у, у) 


(the Cauchy-Schwarz inequality). 
ча If (x, x) = 0, then x = @ and the inequality holds since (0, y) = 0. 
Let (x, x) # 0. Then (x, x) > 0. 
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By the definition of the scalar product the inequality 
(tx — у, tx — у) 20 (6.11) 


holds for any x and y in V and for any real number /. 
The inequality (6.11) may be written as 


P(x, x) - 2tx, у) + (у, у) 20 
The left side of this inequality can be considered as a quadratic trinomi- 
al with respect to £. Since the sign of the trinomial remains unchanged for 
any | we infer that the discriminant of the trinomial is nonpositive, i.e., 


(x, y? — (x, х)(у, у) «0 


Whence we have the desired inequality. 9» 
Definition. A length of an arbitrary vector x is a number |x| given by 
x = V(x, x). 


It is easily seen that |x| > 0 for any x, and |x| = 0 only if x = 6. 





Fig. 6.7 





Consider the following identities 

k+y?=&+y, х + у) 

= (x, х) + 2%, у) + (у, у) = |х| + 2(х, у) + |у. (612) 
Replacing 2(х, y) by 2|(x, y)| and applying the Cauchy-Schwarz inequali- 
ty |(x, y)| < |x||y, we have 

Ix + у < Ix + 21х[у| + ly? = Cx] |у]. 
Extracting the square root, we arrive at the triangle inequality (Fig. 6.7) 

Ix + y| < |x| + Iyl 


The angle between the nonzero vectors x and y is the number ¢ such 
that 0 < o € v and 


(x, y) 


Ix] yl ` 





cos e = 
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By Theorem 6.8 it follows that 


Ix| p]. ^ 
for any nonzero vectors x and y, ie, the range of cos р completely cor- 
responds to its domain. In other words, the angle between any two vectors 
is correctly defined. 
The vectors x and y are called orthogonal if (x, y) = 0. 
From (6.12) it follows that, given orthogonal vectors, there holds the 
identity 





Ix + y? = |x)? + |у 
which may be thought as generalization of the theorem of Pythagoras, 
namely, the square of the lengths of two orthogonal vectors is equal to 
the sum of the squares of the lengths of these vectors (Fig. 6.8). 
The collection of vectors f;, Їл, ..., fx is called orthogonal if (fi, fj) = 0 
for i Æj, and orthonormal if 
еу § 1 fori j, 
(fi, у= (o for i x j. 
Theorem 6.9. Any orthonormal collection of vectors fı, f2, ..., Ёк is 
linearly independent. 
ч. Computing the scalar products of 


об + 025 +... + акк = 0 
by f; G = 1, 2, ..., k), we have k identities 
aj, 65) - 0 —1, 2, ..., k). 
Since (fj, Е) = 1 then a; = 0 for every j G = 1, 2, ..., К). > 
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Let Ї, f2, ..., Ёк be linearly independent vectors in a real Euclidean 
space. We describe the procedure of constructing a collection of К or- 
thogonal vectors using f1, f2, ..., f; (Fig. 6.9). 

«4 Set gi = fı. For the vector g2 = f; — aig: to be orthogonal to g it is 
necessary that the identity 


0 = (f2, 81) — ад, 81) 
holds. 
Whence 
dic (f2, 81) 
1 (gı, g1) 1 
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Thus the vector 


(f2, g1) 
(gi, ёл) P 


is orthogonal to gi. 


g2 =f – 











8,-1, 


(a) 


Fig. 6.9 





Using gi, g2, f3, we construct the vector рз = Ёз — 1р1 — 2р2 which 
is orthogonal both to g; and to gz. It suffices to require that the numbers 
В and бэ» satisfy the following conditions 


0 = (fs, gi) - (р, gı) and 0 = (fs, g2) — B2(g2, g2). 
Whence 
_ (fs, в) aid scc (f3, g2) 





Br = е, в) (82, рә) ' 
so that the vector 
(fs, £1) (fs, £2) 
LE = 
бэ (m ED) (во, во) ^ 


is orthogonal both to р; and to g2. 
Similarly, the vector 


р 0ьв) (fg) | _ (.g-) , 
Gaye бүс ул 77 e er С. 
(i = 3, 4, ..., K) 


is orthogonal to each of the vectors gi, Ёс, ..., gi- 1- 

Therefore the vectors gi, g2, ..., gx form a collection of orthogonal 
vectors. 

Dividing each vector g; (i = 1, 2, ..., К) by its length |g;| we obtain the 
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collection of the orthonormal vectors (see Fig. 6.10) 


grec Bh g M ON 
lg: [gl 


B 
gl 

The basis e = (ei, ёо, ..., ёл) of the Euclidean space is called orthonor- 
mal if 


Lag x uh uum e 
(ei, e) = bu = fo гж) (, j 1, 2, uL n). 


92 
е 
е, 
= = 
е, 8, 
Fig. 6.10 


Summing up, we arrive at the following theorem. 

Theorem 6.10. There exists an orthonormal basis in any Euclidean 
Space. 

Example. Let a; = (1, 2, 0, 1), a2 = (— 1, 1, 1, 0) and аз = (2, 2, 0, 1) 
form a basis in a Euclidean space. 
ча We apply the procedure of orthogonalization to construct the orthonor- 
mal basis in this space. 

Set b, = a; and b; = a; — аа;. For the vector b; to be orthogonal to 
the vector a; it is necessary that the identity 

0 = (22, a1) — a(ai, ai) 


is fulfilled. 
Whence 


= (a2, a1) Е -142 21 
(8,а: 1-441 6’ 





so that 


b= (-b 10-5020 )=(-%.5.4 - i) 
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Evidently, the vector c; = 6b; = (—7, 4, 6, —1) is orthogonal to the 
vector a,. Indeed, 


(со, а) -(-7 х1-4хХ2-46х0-(-1х1-0. 


For vector Ъз = аз — Bai — ус to be orthogonal both to a; and to c 
it is necessary that the identities 


0 = (аз, a1) — (ау, ai) and 0 = (аз, с) — (со, ez) 


are fulfilled. 





Whence 
В = (a3, a1) c 2+4+1 _7 
(a1, a1) 6 6 
and 
_ (аз, е) — -1448-1 _ 7 
Y= (2,c) 49416-3641 102 
so that 


b; = (2, 2, 0, 0-10, 2, 0, 1) + uc 7, 4, 6, -1) 


_ ( 204 — 119 — 49 204 - 238 + 28 42 102 – 119 — 7 
Е 102 ; 102 ' 102" 102 


[бз у>. 2228 
“Хү” 177 17” 17 J’ 
It is easy to see that the vector єз = 176; = (6, —1, 7, —4) is orthogonal 
both to a; and to Со. 
The vectors аџ, бо, єз form a collection of orthogonal vectors. Dividing 


ау, C2, сз by their lengths Уб, V102, 4102, respectively, we arrive at the 
collection of the orthonormal vectors 


ёр 2 ал арлр 











46 
1 
= (-7, 4, 6, 1), 
® X410 
1 
e = 6, —1, 7, 4). 
3 TT ( ) 
The vectors ei, ёо, ез form an orthonormal basis of the Euclidean 
space. 9» 


If we wish to compute the scalar product of two vectors in a Euclidean 
space it is helpful to expand the vectors relative to the orthonormal basis 
because in this case the desired product can be expressed by the simplest 
formula. 
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< [et e = (ei, e, ..., ел) be an orthonormal basis of a Euclidean space. 
Computing the scalar product of the vectors 


x= >) £e and y= we 
i=l j21 
we have 


(x, y) = С Pen 2 ve) T 


D вте, e) = У Ew. 


Ms 
Ms 


1 


i 


In particular 
«x= Gy. 
i=1 


Whence 


m= | X EP > 


6.8 Orthocompliments of Linear Subspaces 


Ms 


Suppose that we are given a linear subspace W in a Euclidean 


space V. 
A collection of vectors in V that satisfy the condition 
(y, x) = 0, 


where x is an arbitrary vector in W, is said to be an orthocompliment of 
W, denoted by №. In other words, the orthocompliment W- of W con- 
sists of vectors y in V that are orthogonal to all vectors in W. 


Properties of orthocompliments. (а) The orthocompliment W - of the 
linear subspace W is a linear subspace in V. 
“4 Let y; and уз belong to W., ie, 


(yi, ) = 0 and (y2, х) = 0 


for any vector x іп W. 
Adding these expressions, by properties of scalar products, we derive 
that for any vector x in W there holds 


(у: + y x) = 0. 


Whence we conclude that у; + y» belongs to WŁ. 

Since (y, x) = 0 for any vector x in W, we infer that (ay, x) = o(y, x) 
and, consequently, ay is a vector in W1. » 

(D V=WOwW!. 
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ча This follows from the fact that the zero vector is the only vector con- 
tained both in W and in W-. > 
Property (b) implies that any vector x in V is uniquely expressed as 


=y+z, (*) 


where y is a vector іп W and z is a vector in W- (Fig. 6.11). 





Fig. 6.11 


The vector y in W is called the W-component of x and z in W- the 
orthogonal component of x with respect to W. 

Let x be a given vector in a linear subspace W. We show how to construct 
the W-component of x and the orthogonal component of x with respect 
to W. 
ча We may assume that the orthonormal basis ei, ёл, ..., ex is introduced 
in W. Then the W-component of x, denoted by y, is expressed as 


y = оле + а2е2 +... + оке. 
Substituting y into (ж), we have 
X = се, + се + ... + окек + 2, 


where z is the desired orthogonal component: of x with respect to W. 
Recall that z is orthogonal to W. Then, multiplying x successively by 
€1, ёо, ..., ey, we obtain 


(х, е) = ол, (X, е) = a2, ..., (X, ек) = ок. 
The vectors 
у = (x, e1)e; + (х, ег)е + ... + (x, ек)ек 
and 


1-Х-у 
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are the W-component of x and the orthogonal component of x with respect 
to W, correspondingly. » 
Example. Construct the W-component of x — (4, 2, 3, 5) for the linear 
subspace W C Р“, specified by the system of equations 
X + X2 + X3 + м = 0, 
-Xx — Хз +x, = 0. 
«~ The vectors a; = (1, 0, 0, —1) and az = (0, 1, —1, 0) form the fun- 
damental system of solutions and, consequently, the basis of W. To con- 
struct the orthonormal basis of W it suffices to divide ац and a2 by their 
lengths. We obtain 


n зу - E Ned oe 
a= (eo 5) and e» СЕЗ DE 


The vector 


"eR -—2_\{(1 TUN 
250 ад 1 -Ll 
(Gwar ww) 
1 1 1 1 1 
= G 2? 0, 0, 3) + (o 773977? 0) =5(-1, -1, 1, 1) 


is the W-component of x = (4, 2, 3, 5) and the vector 


1 1 1 1 
є=хх-у= (43-23 25.43) 


is the orthogonal component of x with respect (0 W. > 


6.9 Unitary Spaces 
A unitary space is a complex linear space U such that given any 

ordered pair of vectors x and y in U, there exists a number called the scalar 
product (x, y) of x and y so that for any vectors x, y and z in U the following 
conditions are satisfied 

(a) (y, x) = (x, y) (here (x, y) is the complex conjugate of (x, y)); 

(b) (x + y, 2) = (x, 2) + (y, z); 

(c) (ox, y) = a(x, у); 

(d) (x, y) > 0, the identity (x, x) = 0 holding for x = 6 only. 

Example. Consider a coordinate space C" whose elements are all possi- 
ble ordered n-tuples of n complex numbers. The scalar product of the vec- 
tors Ё = (£, £, .., Е") and т= (gl, n°, .., 7”) in C" is expressed as 


a= Xe. 
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6.10 Linear Mappings 


Let V and W be two linear spaces, either both real or both complex. 
A linear mapping of a linear space V into a linear space W is a rule 
or operation .2/ which associates every vector x in V with a unique vector 
у = Ax in W so that 
(i) A(x +X2) = Wx, + «x? and (іі) (ох) = osx. 
The conditions (i) and (ii) can be expressed in a single formula 


A (сах: + оох?) = ai Xi + оо Á X2. 


In symbols, we write .o7: V > W to designate a mapping of V into W. 

To illustrate the concept of linear mapping we shall turn to some ex- 
amples: 

(1) Let V = W = M,, where M, is a space of polynomials of degree 
not exceeding л. Laws of differentiation set every polynomial in М, into 
correspondence with its derivative which is also a polynomial of degree 
not exceeding n, i.e., belongs to M,. Yet, the derivative of the sum of the 
polynomials (functions) is equal to the sum of the derivatives of these poly- 
nomials and the derivative of the polynomial multiplied by a scalar is equal 
to the derivative of the polynomial multiplied by the same scalar. Under 
the operation 2: М, > M, which associates every polynomial in M, with 
its derivative the conditions (i) and (ii) are fulfilled. Hence, the operation 
2: M, > M, is a linear mapping. 


Fig. 6.12 


(2) The rule that sets every vector x in V into correspondence with a 
vector Ах in V, where X is a given scalar, is a linear mapping, called the 
similarity mapping (Fig. 6.12). 

(3) Let e = (ei, ёо, ..., ёл) be a basis of V. The rule 2: V — V, which 
associates every vector : 

x = fe. + Pet... Ee +... + Етел = >> Ee 


i=1 


in V with the vector 


k 
Px = Пеј + Pert... + ee = У) Eli 
1-1 
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in V, given k < n, is a linear mapping called the projective mapping. Hence, 
the linear mapping 2: V — V sets every vector x from V into correspon- 
dence with the vector Zx in V (Fig. 6.13). 

(4) The set T2 of trigonometric polynomials of the form 


a cos x + sinx 


forms a linear space. Similarly to (1), it is easy to see that the rule 


9--8-: a cos x + B sin x> —asinx + 8 cos x 


is a linear mapping 
2: T? > T». 


(5) Let A be a given т x n matrix from Р” х" and let X be ann x 1 
matrix, і.е., a column-vector. Pre-multiplying X by A constitutes the linear 
mapping 


XcR"*! + АХ € R™™? 


which associates every column-vector X in Р"! with a column-vector in 
R” x 1 





Fig. 6.13 


The mapping Z: V — V such that “x = x is called the identity 
mapping. 

An image of a linear mapping X: V > W is a set im of all vectors 
from W satisfying the following condition: a vector y in W is contained 
in im if there exists a vector x in V such that ox = y. 

Referring to the examples of linear mappings we mention the folowing: 

(1^) The image of the differential mapping Z: M, — M, is the set of 
polynomials of degree not exceeding и — 1. 


13—9505 
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(2^) The image of the similarity mapping coincides with the linear 
space V. 
(37) The image of the projective mapping 2: V — V is the subspace 


W = L(e;, £5, ass, ex). 


(4') The image of the differential mapping 2: T2 > T» coincides with 
the linear space T2. 

Proposition. The image of the linear mapping .2: V > W is a linear 
subspace of W. 
< Let y; and y? be vectors in im Æ This means that іп V there exist vectors 
xi and x2 such that Æx; = у; and x2 = Ул. 

From the formula 


№у + №уз = № Xi + № Ax = Ax + №) 


it follows that any linear combination of the vectors у; and yz also belongs 
to im o£ > 

The dimension of the image of the linear mapping is called the rank 
of the linear mapping. 

In symbols, we write rank (.27) to mean the rank of the linear mapping 
A. 

The linear mappings 7: V > W and Z: V >W are said to coincide 
if for any vector x in V there holds 2x = Zx. 

In symbols, we write .2/— @ to mean the coincidence of 2/ апа Z. 

Theorem 6.11 (on construction of a linear mapping). Let V and W be 
two linear spaces and e = (ei, ез, .., ел) be a basis in V, and let fi, 
f2, ..., fn be arbitrary vectors in W. 

Then there exists the only linear mapping 


a:VwW 
such that 
Aek = fy (k = 1, 2, ..., n). (6.13) 


First, we prove that a linear mapping satisfying (6.13) exists. 
“4 Consider the expansion of some vector x in V with respect to the 
basis e 
Р 
x= Y ek 


k=1 


and the mapping 27: V > W such that 


AX = Y ЕКЕ. (6.14) 
К-1 


6.10 Linear Mappings 195 


It is easy to verify directly that the induced mapping is linear. 
Let 


y= D ner. 
к-1 
Then applying (6.14) we have 


A(X + py) = 2. (МЕ + un fk 


=) X Efe tu У nf = x + py. 
k=1 k=1 
Whence we infer that 
We, = fk (К = 1, 2, ..., п). > 


Now we show that the linear mapping satisfying (6.13) is unique. 





Fig. 6.14 


ча Consider the mapping «7 satisfying (6.13) and the mapping Z: V > W 
such that 


Be =k (К =1, 2, .., n). 


Computing the actions of æ and @ on an arbitrary vector x in V, we 

arrive at the identity 
n n n 
Ax = 5 ЕЧ, = > E de, =@ ( M бе) = Øx. 
k=1 k=1 k=1 

Whence we conclude that the linear mappings .2/ апа 2 coincide. > 

From Theorem 6.11 it follows that we may define a linear mapping by 
identifying its action on the base vectors of V. 


13* 
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The kernel of the linear mapping æ: V > W is a set of all vectors of 
V which are mapped into the zero vector of 0, of W. 

We denote the kernel of © by ker £ 

Turning back to the examples (1-5) of linear mappings, we can easily 
notice that 

(1”) The kernel of the differential mapping 2: M, — М, is a set of 
polynomials of degree zero. 

(2”) The kernel of the similarity mapping consists of 6, only. 

(3") The kernel of the projective mapping Z: V > V is the linear sub- 
space L(ek+1, ex42, ..., ex) (Fig. 6.14). 

(4") The kernel of the differential mapping 2: T2 > Т» consists of 6, 
only. 

(5") The set of solutions of the homogeneous linear system 


АХ = 0 
form the kernel of the mapping 
DA: Rex! ay Rx! 


Proposition. The kernel of the linear mapping 47: V > W is a linear 
subspace of V. 
~ The identities 07x = 0, and Zy = 0, imply that 


AX + uy) = Ao Xx + р/у = №, + рд, = Oy >». 
The dimension of the kernel of the linear mapping £ is called the nul- 
lity of £ 
In symbols, we write dim ker <= nullity л 
Notice that for any linear space .2: V > W the following identity holds 
rank + nullity = dim V. (ж) 


Operations on linear mappings. Let 07: V > Wand Z: У > W be two 
linear mappings. 

The sum of the linear mappings £ and @ is the mapping €: V > W 
such that given any x in V 7 


€ Xx = fx + x. 
It is easy to verify that the mapping 4 is linear. Indeed, 


€ Qx + uy) = MK + py) + BAX + uy) 
= NA+ Ф) + щ y + By) =rOx + реу. 
In symbols, we denote the sum of Wand Z as = A+ Z. 
The product of the linear mapping л: V > W by an arbitrary number 
а is a mapping @: V > УУ such that given any x in V 
х = a AX. 
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The mapping . is linear since 
4 (Xx + uy) = о XY (Ax + py) 
= Xa x) + ще Wy) = A x + ру. 


In symbols, we write 4 = aX 


6.11 Linear Operators 


We shall confine ourselves to linear operators, that is, to linear map- 
pings from a space V to V. Notice that the differential mappings, the identi- 
ty and projective mappings and pre-multiplication of a square matrix by 
a column-vector that we have encountered in the previous sections are all 
examples of linear operators. 


Multiplication of linear operators. Let v~: V ^ V and 42: V ^ V be 


two linear operators. 
The product of the linear operator .0 by the linear operator - is the 
mapping 4: V — V such that 


C x = u (x). 
We show that 2 is a linear operator. Indeed 


ТОМ + py) = 4 (S (x + py) = 4 (Х.х + р у) 
= №. (ох) + uy) = х + u^ y. 


In symbols, we write б = 4.0% 
Remark. Notice that in general Z% 20 as is easily seen from the 


following example. 
Example. Let V = R? (Fig. 6.15). 
* The mappings 


of: (P, PY (4,0) and 4: (E, Р) o (€  £, Р) 


are linear operators from V into V. 
Then 


BA: (E, £) (E, 0) and 22: (E, £P) ^ E + £, 0). 
Whence we deduce that 

BAE, Р) + ANE, £) 
if £g #0, > 
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g? 





Fig. 6.15 


Definition. The linear operators 27: V > V and 2: V > V are said to 
be equivalent if given any vector x in V there holds 2х = x. 

Let л: V — V be a linear operator. 

The linear operator 2: V > V is said to be an inverse of æ if 


Bel = AB = F, (6.15) 


where Z: V > V is the identity operator, ie., 2х = x for any vector x in V. 

Theorem 6.12. For the linear operator £: V > V to be invertible it is 
necessary and sufficient that the image of «f coincide with the space V, 
ie, im = V. 
“4 Suppose, first, that there exists a linear operator Z inverse to the given 
linear operator .2/ | 

Recall that im 27 of 2/is a subspace of V. 

We show that an arbitrary vector y in V belongs to im Æ 

Let x = @y. By (6.15) we have 


Ax = A(By) = (С2)у = Sy = у. 


Whence we infer that the vector y is the image of the vector x = @y 
and, consequently, y belongs to im Æ Therefore, im £= V. > 

Now we suppose that the image of .2/ coincides with the space V, i.e., 
im .2/ = V. Then 


rank o&= dim V 
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and the linear operator .2 maps a basis of V into another basis of V, i.e., 
A: e= (ei, е2, ..., en) >f= (fi, Р, ө fn), 


where f, = «ey (k = 1, 2, ..., n). 
Consider a linear operator @ such that 


Фё = ек (К = 1, 2, .., n). (6.16) 
Theorem 6.11 states that the linear operator Z satisfying (6.16) is 
unique. 


Let us compute (Фо/х and (922 х for an arbitrary vector x in V. 
“4 Relative to the basis e the vector x takes the form 


x= X £*e,. 
k=1 
Using (6.16), we obtain 


(BZA х = (AY) ( x бе) = 2 ( Y бое) 
к= 1 к= 1 
= х Е d (o e.) = x t*Zf, = Y EK ex = X. 
= к= 


к=1 =1 


Analogously, relative to the basis f we have 


n 
x= > n* fx 
k=1 


and 


(242)x = У п (260) = У a Mea = У) тік =x. 
k=1 К-1 


к-1 
Thus for any vector x in V we have 
Фах x and Mex = х. 
Whence we conclude that 
BA — AB = E e 


Remark. Theorem 6.12 states that the inverse Z of æ is uniquely 
defined. 

In symbols, we write 077! to mean the inverse of the linear operator Æ 

Corollary. The linear operator 27: V — V is invertible if and only if 
the kernel of æ consists of the zero vector only, i.e., 


ker = (би). 


This immediately follows from Theorem 6.12 and formula (*) on 
page 196. 
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Example. The linear operator 
s eo (eie) 


compresses the plane uniformly (with coefficient 2/3) towards the axis e 
while the inverse operator 


AL: (e, Р) ә (e. 39) 


extends the plane uniformly with coefficient 3/2 (Fig. 6.16). 





6.12 Matrices of Linear Operators 


Suppose that the linear operator .2: V — V acts on the vectors ёл, 
ез, .., € of the basis e = (ei, ez, ..., ёл) in V so that 


Le; = > afek = ale, + ade; +... + ale, (i=l, 2, ...., n) 


к-1 
The matrix 
al oi a} 
A-Ao-| * * ой 
aï a On 


whose columns comprise the coordinates of the images of the base vectors 
is called the matrix of the linear operator æ relative to the basis e. 


2 
Examples. (1) Relative to the basis eg = 1, е = /, ez = 5 , 
3 
E the matrix D(e) of the differential linear operator 2: М; > Ms takes 


e3 — 
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the form 


D(e) = 


ооо о 
оо н- о 
оноо 


1 
0 
0 
0 


(2) Relative to the basis е = cos x, e; = sin x the matrix D(e) of the 
differential linear operator 2: Т, > T; takes the form 


»e-( 1 i) 


since 
Әе = Zcosx = -sin x = —e; = 0-ei + (-l1):e 
and 
Den, = Dsin х = cos х = е = Le; + O-e2. > 
Let 
y = AX 
and let 
n , n 
x= > {е and y= У) 7 ёс 
і= 1 k=1 


be the expansions of the vectors x and y relative to the basis e. 
Then the column-vectors 


3 т! 
x(e) = E and у(е) = d 
g n” 


which comprise the coordinates of x and y relative to the basis e are related 
as 


y(e) = A(e)x(e). (6.17) 
«4 Indeed, since the expansion of the vector y relative to the basis e is 
unique, comparing 
n 
у= X) тек 
k=1 
with 


у = Wx = b E! oe; = х 3 (È, afer) 
= > (X alt’) ex 


1-1 
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yields 
n 
n= > ой (K=1, 2, ..., n). 
1-1 
Arranging these n identities in matrix form, we have 
1 1 1 1 п 


1) a a2 a Qn 
2 2 2 2 2 
"n 23 ai a2 bey Qn Ё 
n n n n n 
7 «1 az te On Ё 


Whence follows (6.17). > 

Theorem 6.13. The rank of the matrix A(e) of the linear operator 
4: V > V is independent of the choice of the basis e and is equal to the 
rank of 82 
ча Since 


im = L( Mei, Lez, ...‚ LEen) 


the rank of æ is equal to the maximal number of linearly independent 
vectors belonging to the collection of the vectors Zei, oe», ..., LEen. Ву 
virtue of Theorem 6.4 this number is equal to the maximal number of 
linearly independent column-vectors in the matrix А (е), і.е to the rank of 
A(e). 

Therefore 


r(A(e)) = rank o >» 


It is easy to verify that addition and multiplication of the linear opera- 
tors become addition and multiplication of the matrices of these operators 
relative to the same basis and multiplication of a linear operator by a scalar 
becomes multiplication of the matrix of the operator by this scalar. 
ча Ву way of illustration we show that the matrix of = Zæ is equal to 
the product of the matrices A(e) and B(e) of the linear opera- 
tors æ and Æ relative to the basis e, i.e., 


C(e) = A(e)B(e). 
Let 


а/е: = У) akk and ек = У) blem. 
т=1 


Then 


Y, af Bex 


k=1 


5 at ( È ater) = È ( È sm) 


fei 


11 
К) 
5 
Г 
& 
=ч 
Ма 
g 
22 
! 
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Setting 
y= X Brak (i, m= 1, 2, ..., n) (6.18) 
k=1 


we may write 
C(e) = (yr). (6.19) 
On the other hand, since A(e) = (af) and B(e) = (87) (6.18) and (6.19) 
yield 
C(e) = B(e)A(e). (6.20) 


Thus, relative to the basis e, the matrix of the linear operator 4 18 
equal to B(e)A(e). > 

From the equivalence of the operation of multiplication of linear opera- 
tors and that of multiplication of the respective matrices it easily follows 
that the matrix of the linear operator .27 ^! which is inverse to the opera- 
tor æ is the inverse of the matrix A of «o 
“4 Indeed, by definition of the inverse operator we have 


a !a- fF and AAT! =f 

This means that the matrix of 2771, say B, must satisfy the identities 
ВА =I and AB-L 

Whence we conclude that B is the inverse of A, i.e, 
B = A`! > 


Theorem 6.14. Let £: У — V be a linear operator and let А = A(e) 
and A’ = A(e’) be matrices of « relative to the bases e and е’ of a linear 
space V. 

Then 


A’ = S^ !AS, (6.21) 
where S is the matrix of transition from e to e'. 
а Let y = AX. 
Relative to the bases e and e’ the column-vectors of the coordinates 
of x and y are related as 


y(e) = Ax(e) and y(e’) = A’x(e’). (6.22) 
By Property (b) of transition matrices we may write 
х(е) = Sx(e’) апа у(е) = Sy(e’). (6.23) 


Substituting (6.23) into the first identity of (6.22), we obtain 
Sy(e’) = ASx(e’). 


204 6. Linear Spaces and Linear Operators 


Whence, using the second identity of (6.22), we have 
SA'x(e') = ASx(e’). 
Notice that the above identity holds for any vector x. Hence we may 
write 
SA’ = AS. 
Since the transition matrix S is nonsingular and, consequently, inverti- 
ble, pre-multiplication of the last identity by S ^! gives (6.21). > 
Corollary. The determinant of the matrix of a linear operator remains 
unchanged in any basis. 
“<4 Let us compute the determinant of the matrix 
A(e’) = S^! A(e)S. 
We have 
det A(e’) = det (S^ !A(e)S) = det S~'-det A(e)-det S = det A(e), 
since 
det S^! = (det S) |. > 


It is easy to verify that the determinant of the matrix of the linear 

operator 
AL- 1E 

where //is the identity operator and t is an arbitrary number, also remains 
unchanged in any basis. 
ча Indeed, let A(e) — fl and A(e’) — ГЇ be matrices of the operator 
9/7 — t f relative to the bases e and e’. 

Using (6.21), we obtain 


A(e’) — tI = S^! A(e)S – И = S^ (A(e) - 116. 
Then by Corollary of Theorem 6.14 we have 
det (A(e’) — tD = det (A(e) – А). > 


Expanding the determinant 


a} —t а} ie о} 
2 рч 
det (A - m= | % 02-1 оп 
ол a ол = Г 


-(-1/Ч7 + yn" 71 +... + ү + уо, 
we can write 
x(t) = det (A — tD = (- "t" + ya-it" 7! + + yat + yos 
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The polynomial x(£) is called the characteristic polynomial of the linear 
operator .2/ and the roots of x(t) are called the eigenvalues of the linear 
operator ££ (the matrix A). Notice that the characteristic polynomial x(t) : 
is independent of the choice of a basis. 


6.13 Eigenvalues and Eigenvectors 


The nonzero vector x in V is called the eigenvector of the linear 
operator 7: V — V if there exists the eigenvalue ^ of æ such that 


Ax = №. (*) 


Examples. (1) Any polynomial of the zero degree is the eigenvector of 
the differential operator 


2. 
= 9; ` М, > Ma. 


<4 Consider а polynomial of the zero degree, which is equal to 1. Then 
applying (*), we have 
а m- 2 
"E (020x120 


Whence it follows that the eigenvalue А of Z is equal to zero. > 
(2) The differential operator 


2-2. Т; > T; 


has по eignvectors. 
ча Consider the trigonometric polynomial а cos ¢ + @ sin f. 
Using (*), we have 


2 (a cos t + B sin t) = А (а cost + В sin t). 
This means that 
— a sin і + @ cos t = № cos t + № sin f, 
Or 
(8 + a) sin t + (Хе — 8) cos t = 0, 
which is fulfilled if and only if 
а430-0 апа № – 8-0. 


Whence a = 8 = 0. Consequently, the polynomial o cos £ + В sint is а 
zero one and can not be the eigenvector of Z. » 
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Theorem 6.15. The number is the eigenvalue of the linear opera- 
tor £ if and only if \ is the root of the characteristic polynomial x(t) 
of Hie. Х(№ = 0. 

“8 Suppose that А is the root of the polynomial x(f) in which case 


xA) = det (A(e) — №) = 0. (6.24) 
Consider the homogeneous linear system 
Е 0 
2, 
a9 -3»[ 51-19 
P] No 
Or 
(ol — J£ 4028 oc ol = 0, 
ast} + (o$ == ХЕ хийг тын azg" = 0, (6.25) 
alt} + a3? +... + (ал -NE = 0. 


By virtue of (6.24) system (6.25) has the nonzero solution £!, Ё, ..., 
£" and 


x= > t'e; z b. 
1-1 


The column-vector x(e) of the coordinates of х satisfies the condition 
(A(e) — М)х(е) = 0, 
which may be written as 
A(e)x(e) = Xx(e), 
or, equivalently, 
Ax = №. 
Whence it follows that Х is the eigenvalue of the linear operator .2/ and 
х is the eigenvector of £ 
Now we assume that is the eigenvalue of £ Then there exists a non- 


zero vector x such that ox = ^x. 
Let e = (ei, ег, ..., ел) bea basis. Then we may write the matrix equation 


A(e)x(e) = Xx(e), 
Or 
(A(e) — ADx(e) = 0. (6.26) 
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Since x is the eigenvector of othe column-vector x(e) of the coor- 
dinates of x is a nonzero vector. This means that system (6.26) has a nonzero 
solution. In this case the condition 


det (A(e) — AD = 0 


must be fulfilled. 
Thus 


x0) = 0. > 
Remark. To find all the eigenvectors corresponding to the given eigen- 
value А it is necessary to construct the fundamental system of solutions 
for (6.25). 





Fig. 6.17 


Examples. (1) Let 
g;viov 
be a projective linear operator such that 
P: xi + yj + zk 2 xi + yj. 


Compute the eigenvectors of Z 
~ Let us consider the action of & on the base vectors i, j and К (Fig. 6.17). 


We have 
2?2: i>i, 2:jj and 2: k8. 
Since the matrix of 2 is 


1 0 0 
Р-(10 1 0 
0 0 0 
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the characteristic polynomial of # becomes 


1-3 0 0 
0 1-3 0| --3Х-1) 
0 0 =À 








with roots ^; = 0 and №, №; = І. 
Consider the homogeneous linear systems corresponding to the distinct 
values of ^ equal to 0 and 1. These are defined by the coefficient matrices 


100 0 0 0 
010 and 0 0 0 
ооо 0 0 -1 


respectively. Then we may write the homogeneous linear systems as 


x =0, 0=0, 
y =0, and 0-0, 
0-0 —2 = 0. 


The corresponding fundamental systems of solutions are 


8-0 0 


Whence we deduce that the eigenvectors of # are the base vector k and 
any vector of the form xi + yj (x2 + у? > 0) associated with the eigen- 
values 0 and 1, respectively. > 

(2) Let 

2-4. a+ Bt y — 8 + 2yt 
be a differential linear operator defined over a space of polynomials of 
degree not exceeding 2. 

Compute the eigenvectors of 2. 

+4 Relative to the basis 1, 7, £? the matrix D of 2 takes the form 


010 
D={ 0 0 21. 
000 


Then the characteristic polynomial —À? = 0 has the root 4 = 0 of mul- 
tiplicity 3. 
Consider the homogeneous linear system 


8-0 
2y = 0 
0= 0 


whose solution is 1, 0, 0. The triple 1, 0, 0 corresponds to the polynomial 
of the zero degree which is the eigenvector of 2. » 
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6.14 Adjoint Operators 


In a finite-dimensional Euclidean space a linear operator can be 
subjected to an operation which gives rise to a linear operator called the 
adjoint operator. 

Let V be an n-dimensional Euclidean space and 47: V > V be a linear 
operator. 

We shall associate with л а linear operator o7": V — V such that for 
any vector x and any vector y in V there holds 


(x, у) = (x, 27). (6.27) 


Definition. The linear operator .0*: V — V satisfying (6.27) is called 
the adjoint operator of the linear operator o V > V. 

Let us show that for any linear operator æ there always exists the ad- 
joint operator .0*. 
ча Let e = (ej, €, ..., ел) be an orthonormal basis of V and A = 
A(e) = (od) be a matrix of the linear operator 2 relative to e, in which case 


Ой, = Dale; (i 2-2: х n). (6.28) 
j=1 


Transposing the matrix А and setting B = A’, i.e., Bi = ad for any i 
and any j (i, j = 1, 2, . . ., п), we may define the linear operator 47”: V > V 
such that 


ме = > Ble; (і = 1, 2, ..., п), (6.29) 
Ј= 1 
where 
Ві = а! апі і ј = 1, 2, ..., п. (6.30) 


(Recall that by virtue of Theorem 6.11 a linear operator is uniquely defined 
by its action on the base vectors of V.) 

To verify (6.27) we first set x and y equal to the base vectors, say, x = e; 
and у = ej. 

Since the basis e is orthonormal (6.28) gives 


(oe, е) = (2 afek, «) = >; оек, е) = >) ок) = ої. 
К-1 К-1 К-1 
Analogously, using (6.29) and (6.30), we obtain 
(ei, 27" ej) = (s. > bfer) = У phe: ex) = D бк = Ві = ol 
к-1 k=1 k=) 
Whence 
(Ze, е) = (е, 07е) (Lj-21,2,... n) (6.31) 


14—9505 





Now we suppose that x and y are arbitrary vectors in V. Relative to 
the orthonormal basis e the expansions of x and y are 


x = >) ee; and y = У) rej. 
і= 1 j=1 


Then computing the expressions on the left- and right-hand sides of 
(6.27), we have 
(ox, у) = (s (dee : Zve) = х YR (а/е, ej) 
i- 121 


і=1ј=1 


апа 


% any) = (Уве, an (Zve) ) = ZXtre. аер. 
1-1 -1 і=1ј= 
Whence by virtue of (6.31) we arrive at the desired result. > 
Example. Consider a two-dimensional linear space of polynomials of 
degree not exceeding 1 with real coefficients. 
Let 


e(t) = а + bt and (1) = c + dt 
be any two polynomials of degree not exceeding 1 and let 
(e, №) = ac + bd (*) 


be a scalar product of e and y. 

Then a two-dimensional space of polynomials of degree not exceeding 
1 with real coefficients becomes a two-dimensional Euclidean space Mi. 

Let Z: М, > M; be a differential operator such that 2 (a + bt) = 

We can define the adjoint operator 2": M; > Mi as follows. 

Notice that the polynomials 1 and ¢ form the orthonormal basis of My 
Since using (ж), we obtain (1, 1) = (£, t) = 1 and (1, £) = 0, in which case 
Z (1) = 0 and 2 (f) = 1, and the matrix of Z becomes 

0 0 

Then 10 
21) = t and 2) = 0. 

For an arbitrary polynomial ф(/) = a + bt we have J: a + bt > b and 
2*: a+ bt at. 

Properties of adjoint operators. (a) Every linear operator has a single 
adjoint operator. 
“5 Suppose that Z and аге two distinct adjoint operators of a given linear 
operator .2/ Then for any x and any y in V there hold _ 

(2/х, у) = (x, Zy) and (x, у) = (x, €y). 


0 
0 0/ 
is the matrix of the adjoint operator 2* such that 
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Whence 
(х, y) = (x, Су) 


and 
(x, у — Фу) = 0. 


Since х is an arbitrary vector in V we infer that the vector Фу — (у 
is orthogonal to every vector in V and, consequently, to itself. The latter 
implies that Zy — y = 0 and Zy = Фу. Hence, Z = c since y is an ar- 
bitrary vector in V. > 

From Property (a) it immediately follows that 

(b) (ам) = ao", where a is an arbitrary real number; 

(c) (+B) = x" + 4 

(d) (ABY =A"; 

(е) (07) = £ 

We shall also mention the other two important properties of adjoint 
operators, namely 

(f) Let e be an orthonormal basis of V. Then for the linear operators 
A: V —^ V and Z: V > V to be mutually adjoint, ie, to satisfy both 
B= A£" and £= Z" it is necessary and sufficient that relative to the basis 
e the matrix of one of the operators, say B = B(e), be obtained by transpos- 
ing the matrix A = A(e) of the other operator so that B = A’. 

Notice that this property holds true only if A and B are matrices ar- 
ranged relative to an orthonormal basis and violates otherwise. 

(g) If a linear operator æ is nonsingular so is the adjoint operator .27* 
of æ and 


(7 = (n). 


6.15 Symmetric Operators 


A linear operator is said to be symmetric (or self-adjoint) if £ 
is identical to its adjoint operator 2/* so that 47-27 

Notice that by virtue of Property (f) the matrix of a symmetric operator 
relative to an orthonormal basis is also symmetric and remains unchanged 
when being transposed. Therefore a symmetric operator is adjoint to itself 
and may also be called the self-adjoint operator. 

Example. Let 2 be a linear operator that defines an orthogonal projec- 
tion of a three-dimensional Euclidean space with the induced Cartesian 
coordinate system Oxyz onto the xy-plane (Fig. 6.18). 

Relative to the orthonormal basis i, j, К the matrix of 2 takes the sym- 
metric form 


14* 


с up PP —їЄ— ————— 
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о о -– 
O- о 
ooo 


since Zi = i, Aj = j and ФК = 6. This means that the operator Fis sym- 
metric. 





Fig. 6.18 


Properties of symmetric operators. It is worth mentioning some remark- 
able properties of symmetric operators. The first two properties given below 
are the direct consequences of the definition of a symmetric operator. 

(a) For a linear operator 47: V > V to be symmetric it is necessary and 
sufficient that for any vectors x and y in V there holds 


(x, у) = (x, £y). (6.32) 


(b) For a linear operator to be symmetric it is necessary and sufficient 
that relative to an orthonormal basis its matrix become symmetric. 

(c) The characteristic polynomial of a symmetric operator (and the as- 
sociated symmetric matrix) has only real roots. 

Recall that any real root ^ of the characteristic polynomial is the eigen- 
value of the corresponding linear operator .2/, і.е., there exists a nonzero 
vector x (the eigenvector of £) such that Æx = ^x. 

(d) The eigenvectors of a symmetric operator corresponding to distinct 
eigenvalues are orthogonal. 

48 Let x; and x2 be the eigenvectors of æ so that 7x; = dix, and 
Axı = №х›, and let M ¥ №. 
Since & is a symmetric operator we have 
(2х1, X2) = (X1, 2x2). 
On the other hand, 


(X1, X2) = (№), x2) = (xi, хэ) 
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and 
(х1, 4 X2) = (xi, A2X2) = MAX, X2). 
Whence 
Х(Х, X2) = XQ, X2) 
and (Ai — X)(xi, Хо) = 0. 
Since №, — № = 0 we arrive at 
(х2, X2) = 0. > 
(е) Let 27: У > V be а symmetric operator. Then in V there exists an 
orthonormal basis e = (ei, ёо, .. ., ёл) comprising the eigenvectors of 27 
so that 


Ae; = Ne; (i = l, 2, TE n), 
(ei, ej) = бу (i j = 1, 2, og n). 


Turning back to the previous example we easily see that the triple 
(i, j, k) is the desired orthonormal basis in V since the vectors i and j are 
the eigenvectors of Z corresponding to the eigenvalue 1 (of multiplicity 2) 
and k is the eigenvector corresponding to the eigenvalue 0. 

(f) If a nonsingular operator 47: V > V is symmetric so is its inverse 
2 У-үу, 

Remark. All the eigenvalues of a nonsingular operator are distinct from 
zero. Indeed, if А #0 is the eigenvalue of a nonsingular operator 4, 
then 1/A is the eigenvalue of the inverse operator æ ^ !. 

We shall say that a symmetric operator s% is positive if given any non- 
zero vector x in V there holds (0х, x) > 0. 


Properties of positive operators. (a) A symmetric operator £: V > V 
is positive if and only if all the eigenvalues of 2/ are positive. 

(b) A positive operator is nonsingular. 

(c) If an operator is positive so is its inverse. 


6.16 Quadratic Forms 


Let A = (ац) be a symmetric matrix of order n so that aj = aij. 
Then the expression 
n n 
У Daye? (6.33) 
і=1ј=1 
is said to be the quadratic form in the variables £, £,..., Е". The matrix 
A is called the associated matrix of the quadratic form. 

The quadratic trinomial ax? + 2bxy + cy?, where a, b and c are real 
numbers, serves as an example of the quadratic form in two variables x 
a b 

c 


and y, the associated matrix being b 
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The n-tuple of numbers Ё, #2, ..., £" may be regarded as the coor- 
dinates of a vector x in an n-dimensional real space V relative to a given 
basis so that 

x = fe, + Pe; +... + Men, 


where е = (e1, €2, ..., ёл) is the orthonormal basis of V. Then (6.33) be- 
comes a numeric function of a vector-valued argument x defined over the 
space V. This function is customarily written as 


a (x, x) РЕ (6.34) 


We shall also say that the function æ (x, х) is defined in an n-dimensional 
Euclidean space V. 
We may also associate with any quadratic form (x, x) the bilinear 
Jorm 
n n 2 
a (x, y) = рэр atm, (6.35) 
izi= 
where n}, 12, ..., 7" are the coordinates of the vector y relative to the 
orthonormal basis е = (e1, ez, ..., ёл) so that 


у = е + yen +... + n'en. 
The form (6.35) is called bilinear since it is linear in both the argument 
x and the argument y so that 


A (алх + 022, y) = X (X, y) + a2 (x2, y) 
and 
A(x, Biy1 + B2y2) = Bio (X, ул) + B2 (x, уз), 


where ол, аг, В; and 8: are arbitrary numbers. 
The bilinear form (6.35) is symmetric since its value is independent of 
the order in which x and y occur in (6.35), i.e., 


Ay, x) = L(x, y). 
Computing the value of æ (x, y) for the base vectors, i.e, for x = ex, 
y = em, we obtain 
A (ек, ем) = акт. (6.36) 
Whence it follows that the elements of the associated matrix A of the quad- 
ratic form (6.34) are the values of the bilinear form computed for the vec- 


tors of the basis e. 
The scalar product of two vectors in an n-dimensional coordinate space 


(§, т^ = En Bye t+... + Ev, 
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where £ -(Б,8,...8)6Є R” and q = (9, 72, ..., т") € R^, is the bilinear 
form. 
The associated quadratic form 


| 2-4, р = £y + (y н... (Е) 


defines the square of the length of the vector Ё. 

The coordinates of a vector x relative to a different basis are different 
and so is the matrix of the quadratic form. 

In a variety of applications we need to simplify a quadratic form by 
converting it to a diagonal or normal form. 

A quadratic form is said to be of diagonal form if the coefficients in 
all £'-£ are equal to zero. In other words, a quadratic form s (x, x) is of 
a diagonal form if aj; = 0 for all i # j and 


of (x, x) = ou(E + az (£y +... + ал (Ё). 


The associated matrix is also diagonal, i.e., 


“11 0 
E o2... s) 


Theorem 6.16. For any quadratic form defined over a Euclidean space 
there exists an orthonormal basis relative to which the associated matrix 
becomes diagonal. 
ча To prove this theorem we shall use the arguments that follow from 
properties of symmetric operators. 

We choose the orthonormal basis e = (ei, ег, . . ., ёл) and consider the 
linear operator 47: V — V such that, relative to e, the matrix (al) of £ 
is identical to the matrix (aj) of a given quadratic form, ie, oj = aij. 

Since (oj) is symmetric so is the operator o 

Let us compute (2x, x). Since the basis e is orthonormal we have 

(Zei, еј) = od = ay, 
and 


(x, x) = (x (Zee), bo «) 
= 2 


Ma 


LEE Ger, ej) = » Seyi = A(X, x). 


i-lj-1 


J 


Whence we infer that the quadratic form 27 (x, x) defined over a Euclidean 
linear space V and the symmetric operator acting in V are related as 


A (x, X) = (Zx, x). (6.37) 


Recall that for any symmetric operator and, in particular, for æ in V 
there exists an orthonormal basis f = (f1, f2, . . ., fn) comprising the eigen- 
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vectors of «so that 
A fk = Akfk (k = 1, 2, э cag ny (fk, fm) = Ókm. (6.38) 
Notice that 
(X fr, fm) = (\кїк, fm) = Хкбкт = { 
Substituting the expansion of x 
x= > n fx 
К-1 
relative to the basis f = (fi, f2, ..., fn), into (ох, x), we have 
(97x, x) = (У( У; ч), 23 42) 
К-1 m=! 


D nV fe, fm) = У мо). 


К-1т-1 


Whence, (6.37) yields 
M (x, х) = E NMa Y. (6.39) 
kzl 


Thus, the matrix A(f) of the original quadratic form becomes diagonal 
relative to the basis f, so that 


A(f) - (09-44) 


We may convert a quadratic form to diagonal form without making 
explicit computations of the base vectors of f. It suffices to compute the 
eigenvalues of the corresponding linear operator or, equivalently, the eigen- 
values of the associated matrix A = (aj) counted with their multiplicities. 

Example. Reduce the quadratic form 


М (К = m), 
0 (kzm) 


A (X, x) = 2xy + 2yz + 2xz 


to the diagonal form. 
ча The associated matrix is 5 


0 1 1 
1 0 1]. 
110 


To find the eigenvalues we must solve 
—^ 1 1 
1 -^ 1 
1 1 -À 
yielding М = 2 and №з = —1. 


--У-3-2-0 
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Thus we have 
OW (x, x) = 2%? – у? – 2°. 


It is much harder to compute the desired orthonormal basis. To this 
end we shall find the eigenvectors of the symmetric operator ./ that are 
identical to the eigenvectors of the matrix of the quadratic form .«/ (x, x). 

Let А = 2. Consider the homogeneous linear system specified by the 
coefficient matrix 


-2 1 1 
1 -2 1 }. 
1 1 -2 


We have 
—2x+ y+ z=0, 
х-2у- z=0, 
x+ y-2z-0. 


All solutions of the system are proportional to the vector (1, 1, 1)’. 





А шоо 1 1 ry 
Hence, the unit vector is i = , 5 Ё 
( v3’ Үз, үз ) 


Їс13 = —1. The homogeneous linear system defined by the coefficient 
matrix 


111 
111 
111 


has two linearly independent solutions and we have to choose them so that 
they become orthogonal. 

The system reduces to the single equation x + y + z = 0. Then the 
desired solutions are (1, —2, 1)' and (1, 0, —1)’ and the unit vectors are 
1-(1/ 6, —2/v6, 1/V6)’ and k = (1/72, 0, —1/v2)’. 

It is easy to verify that both the vector j and the vector k are orthogonal 
to the vector i. (Notice that this result also follows from Property (d) of 
a symmetric operator.) 

Then the desired orthonormal basis comprises the vectors 


‚„ 





Remark. We may accept any n-dimensional Euclidean space as V. 
However, of practical interest is a coordinate space IR" whose elements are 
all possible ordered n-tuples of real numbers ¢ = (Ё!, £, . . ., Ё"). The basis 
of R” comprises the vectors (1, 0, ..., 0, 0), (0, 1, ..., 0, 0, ..., (0, 0, 
..., 1, 0) relative to which the scalar product of two vectors & = (#1, £, 
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..« £) and у = (7, зү, ..., 9”) is given by the formula 
(Е, = £n + £v +... + E 
We shall describe the procedure that enables us to choose the basis rela- 


tive to which a given quadratic form specified over an n-dimensional coor- 
dinate space becomes diagonal. 


а Let (x, x) = У) J оу? be a given quadratic form. 


i=1j=1 


Step 1. Write down the associated matrix 


Q11 O12 ..-. Qin 
Q21 022 ... Qn 
Олі Qn2 Qnn 


Step 2. Solve the polynomial equation 


ay—f Q2 20253 Ойл 
ол 022-1... On —0 
Ол Оїл2 Onn — t 


yielding the eigenvalues of the associated matrix of (x, x). 
Arrange the eigenvalues № € №... € An, their multiplicities counted. 
(Notice that all the eigenvalues are real since the matrix is symmetric.) 
Step 3. Let \ be a root of multiplicity К. Then the homogeneous linear 
system specified by the coefficient matrix 


аі — À 012 ... Qin 
Q21 22 — Noves Q2n 
Oni Qn2 Qnn — № 


has exactly k linearly independent solutions that form the fundamental sys- 
tem of solutions. On normalizing the solutions we obtain k pairwise unit 
vectors. 

Repeating this process for the other eigenvalues, we obtain exactly n 
pairwise orthogonal unit vectors that comprise the orthonormal basis f1, 
12, . . ., fn of R”. Notice that the vectors corresponding to the distinct eigen- 
values are orthogonal by virtue of Property (d) of a symmetric operator. 

Step 4. Write down (x, x), relative to the basis f = (f1, f2, . . ., fn), 
in the diagonal form 


& (x, x) = №0) + yy +... + №), 
where x = n'fi + nfz +... + E. 
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Definition. The quadratic form 


ax, x) = Ў Хайр (6.40) 


i=1j=1 
is called positive-definite if given any nonzero vector x or, equivalently, 
given any nonzero n-tuple #1, #2, ..., £", there holds 

4 (x, x) > 0. 


The scalar square of an arbitrary vector £ = (El, #2, ..., £") of an n- 
dimensional coordinate space given by the formula 


E, 9 = E+ yv... gy 


is an example of a positive-definite quadratic form. 
On reducing the positive-definite quadratic form æ (x, x) to the di- 
agonal form, we have 


& (x, x) = №01)? + №2) +... 444179, 


where; > 0, № > 0, ..., AW > 0. 

Criterion for a quadratic form to be positive-definite. The quadratic 
form æ (x, x) is positive-definite if and only if the leading minors of the 
associated matrix æ% i.e., the minors cut out of the left-hand upper corner 
of % are all positive, i.e., 








Q11 012 Qik 
O11 Q 221] O22 O2k 
an > 0, |“ 912 , Бае А уы, > 0, 
012 022 ў 
Qik A2k Okk 
Qil o2 Qin 
Q12 022 Q2n 
аза сана ө: жоё, ж.ж > 0. 
Qin Q2n Onn 


Diagonalization of a quadratic form by completing the square. We shall 
explain the procedure which is useful to convert a quadratic form to a di- 
agonal form and, in particular, to decide on definiteness of quadratic 
forms. 

* Let 
n n : 
A(x, x) = 2; 2joyE Ë 
(-1/-1 
be a given quadratic form and let a1 # 0. 

By a simple algebra we can reduce the sum of all terms involving Ё! 

to the form 
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anl) + 2428 EP +... + 2ang t" 
12 2012 „1,2 Qin n 
an (tt +2 m кё+...+2 е pe) 





2 n n 
= an (# жар yg оа p) уу, анау ну 
Q11 Q11 i=2j=2 Qi1 
Setting 
qa ey нь Py Se 
O11 O1 


gat! (k=2,3,... n), 
we obtain 
A(x, х) = ат)? + 2 PELLE 
i-2j- 


where 
Qij 


anim “11 


We look now at 
n n ХЭЛ 
24 (х, x) = 2; D aim. 
i-2j-2 
It is easy to see that .24 (x, x) is a quadratic form in (n — 1) variables and 
can also be represented as the sum of the square of one variable and the 
quadratic form in the other (n — 2) variables. Thus, repeating this process 
of “completing the square" we finally arrive at the desired diagonal form 
of (x, x). 
Поп = 0 but ay (2 < i < n) is distinct from zero, we start the process 
by completing the square of £. 
Now we suppose that in (x, x) all the coefficients in squares of £' 


(21,2,..., n) are equal to zero, ie, 011 = а =... = ан = Qnn = 0. 
Then by the substitution 

=з! + тр, 

2 1 


7, 
E = ok (k = 3, 4, ..., n) 
the quadratic form (x, x) is reduced so that we again have the general 
case. Indeed, by this substitution the term 20:12 12 is reduced to 2a12(71)? — 
201200). > 
Example. By completing the square reduce the quadratic form 
A(x, X) = 2ху + 2yz + 2zx. 
to the diagonal form. 
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ча By the substitution 
x=ã+f, у-Х-У/, 1-2 
З (х, х) is reduced to the form 
A(X, x) = 222 — 2y? + 40 = UË + 2xz) – 29? 
= 26 + zy — 29? - 227, 
Set X=X¥+Z ў = у, 2-2 Then 
A(x, х) = 2% — 29? – 22. > 


Remark, The major shortcoming of the process of completing the 
square is that it involves the trasformations of coordinates which are not 
orthogonal, that is, the new coordinates taken in pairs are not orthogonal. 

On comparing the diagonal forms of 2xy + 2yz + 2zx obtained by ap- 
plying the procedure that involves identification of an orthonormal basis 
and the procedure of completing the square we easily see that in both cases 
the number of positive terms remains unchanged and so does the num- 
ber of negative terms. This is an important property of quadratic forms 
called the /aw of inertia which states that for any quadratic form the num- 
ber of positive terms remains the same in all its diagonal forms and so 
does the number of negative terms and the number of zero terms. Thus 
these numbers are independent of procedures applied to reduce a given 
quadratic form to a diagonal form. 


6.17 Classification of Curves and Surfaces 
of the Second Order 


We are now well equipped to turn back to the analysis of the general 
equations of curves and surfaces of the second order which we have en- 
countered in Chap. 4. 


Plane curves. The general equation of the second-order curve in the 
xy-plane is 
ах? + 2bxy + су? + 2dx + 2ey + f = 0, 


where а? + b? + c? > 0. 
The associated matrix of the quadratic form ax? + 2bxy + cy? is 
(5) 
bcj’ 
Computing the roots №; and M of the characteristic polynomial and 


the corresponding orthonormal eigenvectors i and j, we can use i and j 
as the unit vectors of the new coordinate axes, the X- and j-axes, respectively 
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(Fig. 6.19). Then the original equation becomes 
м + ay? + 218 + 267 + f = 0. 


Two cases have to be distinguished: (1) \1-A2 # 0 and (2) either ^; or 
№ is equal to zero. 
(1) By the translation 


Jee ad Е. 
AERES? Куур 


the equation is reduced to the form 
МА XY? + f- 0. 





Fig. 6.19 


In a way similar to that we have followed in Chap. 4 we consider all 
possible sequences of signs of А, \2 and f'and finally arrive at the equations 
which specify an ellipse, a hyperbola, a pair of intersecting lines, a point 
and an empty set in the xy-plane. _ 

(2) For definiteness we set ^; = 0 and Mz # 0. Then by the translation 

X=fta Y-je-f- 
№ 
the equation 

y? + 2d$ + 269 + f=0 
is reduced to the equation 

ho Y? + 2dX + f= 0. 
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If d = 0 we put a = at , thus arriving at the equation of a parabola 


MY? + 2dX = 0. 
If d = 0 we put a = 0, thus obtaining the equation 
NY + f=0 
which specifies either a pair of parallel lines or a pair of coinciding lines 
ог an empty set corresponding to different signs of f/X;. 
Remark. Computations of the roots of the characteristic polynomial 
and the corresponding orthonormal eigenvectors are used here instead of 


a suitable rotation of coordinate axes employed in Chap. 4 to eliminate 
the term 2bxy from the general equation. 


Surfaces of the second order. The general equation is 


оліх? + 2012Ху + 2013Х2 + азу? + 2orsyz + a332? 
+ 2014Хх + 2a24y + 20342 + ом = 0, 


where o2, + od; + at + а, + аз + оз > 0. 
To simplify the quadratic form involved in the general equation we con- 
sider the associated matrix 


Q1 G12 Q13 
Q12 022 Q23 
Оз 03 033 


and compute the roots Ху, № and ^; of the characteristic polynomial 


ОШ = Ё ap 913 
azr 022-1 an 
Q13 023 033 — Í 


Let i, j and k be the orthonormal eigenvectors of the associated matrix. 
We accept 1, ў, Е as the unit vectors of the new coordinate axes, the X-, 
J- and Z-axes, respectively, relative to which the general equation takes the 
form 


Мм + hoy? + №22 + 2014 + 2004 + 20342 + ода = 0. 


Three cases have to be distinguished: 

(1) All the three roots Ху, №, ^s are distinct from zero. 

(2) Only one root is equal to zero (for definiteness we set ^s = 0). 
(3) Two roots are equal to zero (for definiteness we set Аз = № = 0). 
Remark. All the roots Ху, №, Аз are never equal to zero simultaneously. 
We shall consider these three cases separately. 
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Case I. By the translation 


O14 л Q24 2 034 
= es = + 
MC Y=yr VEL 2-2 х 


the general equation is reduced to the form 
MX? MY? + №2? + ад = 0. 
(i) If ац # 0 and 


_ (a) ИМ, № апа № are of the same sign which is opposite to that of 
044, We have the equation of the ellipsoid 


x? Y* z? 


Х=ў+ 




















+ + =f, 
a? b с? 
2 0:44 2 Q44 2 Q44 
where a^ = — be = – с = – 
№ № , \з 


E (b) If the signs of ^; and № are opposite to that of œ44 and ^3 and 
сад are of the same sign, we have the equation of the Ayperboloid of one 
sheet 

X^. Y^. Z 











PP AENEA 
а? b? c э 
а а а, 
where d? = — 44. , p? = -M 2. TM 
М À2 A3 


(c) If ^i, № and 044 are of the same sign and the sign of ^; is opposite 
to that of саа, we have the equation of the Ayperboloid of two sheets 














+ = -1, 
2 b? c? 
2 044 2 Q44 2 O44 
where a^ = Ь° = : = – 
м”, X M 


(ii) If Q44 = 0 and 

(а) If X1, № and ^, are of the same sign, the equation defines the point 
(0, 0, 0) in space. 
(b) If any two roots are of the same sign which is opposite to the sign 
of the third root, we have the equation of the cone of the second order 
2 2 2 
X ҮТ Z 
a 


2 b? c 


Case 2. By the translation 








Х=ў+-4&,ү=ў+-54 252 
¥+ M ў x 2 
the general equation is reduced to the form 


MAX? + XY? + 20342 + ац = 0. (ж) 
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(i) If оза = 0, then by the translation 
x-X,y-Yz-ZzZ-e = 
20:34 





the equation (*) is reduced to the form 
Ax? + hoy? + 20344 = 0, 


in which case 
(a) If ^; and X; are of the same sign, we have the equation of the elliptic 
paraboloid 


2 2 
Эа эж TN 
p а - 
where p = => 8 and q = зыб 
1 2 


Notice that we have assumed that the sign of a34 is opposite to that 
of ^; апа \. We can always make this assumption by reflecting the z-axis, 
if necessary. 

(b) If ^, and № are of the opposite signs, we have the equation of the 
hyperbolic paraboloid 


2 2 
e 
where p — - > 0 and q = +4 > 0. 
1 2 


(й) If a34 = 0 then the equation (*) becomes 
МХ? + №Ү + оза = 0, 
which defines а family of cylindrical surfaces whose directing lines lie in 
the XY-plane and are given by the equation 
МХ Y? + ад = 0. 
The following table shows the classification of the corresponding cylin- 


drical surfaces. 
Case 3. By the translation 


X=KX+ Ae ,Y-RZ-z 





the general equation is reduced to the form 
xXx? + 2824Ү + 2034Z + Q44 = 0. 


(i) If оо z 0 and a34 z 0 then we can always reduce this equation to 
the form with o4 z 0 and аза = 0. It suffices to transform the coordinates 


15—9505 
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Table 6.1 







Elliptic 
cylinder 







Empty set 






Hyperbolic 
cylinder 








Pair of 
intersecting lines 





as 
P a2 Y + aZ = аз Ү- 0242, 


3-Х, $j-———.i- = = 
V оа + ada Vaza + 04 
The equation becomes 
X1? + 289 + ад = 0, (*) 


where 8 = Мад + oda . 


Notice that we have to choose the transformation of coordinates such 
that the new system of coordinates will be the Cartesian one. 
By the translation 





x=% у=ў+ 54 





2-2 


28 ' - 
the equation (*) is transformed to the equation of the parabolic cylinder 
x? = 2py. 
(1) If o4 = азд = 0 we have the equation 
iX? + ад = 0, 


which defines a pair of parallel planes when M04 < 0, a pair of coinciding 
planes when ола = 0 or an empty set when ^::044 > 0. 
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Exercises 


1. Define the linear span generated by the polynomials 1 + ¢?, 
гъ, lteter’. 
2. Determine whether the vectors xi, Хо, X3 are linearly dependent or not: 
(а) xı = (l, 2, 3), x2 = (4, 5, 6), хз = (7, 8, 9), (b) xı = (1, 4, 7, 10, xo = 
(2, 5, 8, 11), x3 = (3, 6, 9, 12). 
3. Show that the vectors x; = (1, 1, 1), x2 = (1, 1, 0), хз = (0, 1, —1) form 
the basis of the linear space Р?. 
4. Complement the collection of two vectors (1, 1, 0, 0) and (0, 0, 1, 1) 
to get the basis of the linear space Р“. 
5. Verify that the vectors (2, 2, —1), (2, —1, 2), (- 1, 2, 2) form a basis 
of the linear space Р? and find the coordinates of the vector x = 
(3, 3, 3) relative to the basis of Р?. 
6. Define the dimension and a basis of the linear span generated by the 
vectors x; = (1, 2, 2, -1, x2 = (2, 3, 2, 5), хз = (-1, 4, 3, -1, x4 = 
(2, 9, 3, 5) of the linear space №“ 
7. Compute the angle between the vectors (2, —1, 3, —2) and (3, 1, 5, 1) 
in the Euclidean space Р. 
8. Apply the procedure of orthogonalization to the vectors (1, —2, 2), 
(—1, 0, -1) and (5, —3, —7) іп №. 
9, Let L be a subspace generated by the vectors (1, 3, 3, 5), (1, 3, —5, —3), 
(1, —5, 3, —3). Find the L-component of the vector x and the orthogonal 
complement of x with respect to L if x = (2, —5, 3, 4). 
10. Let —/ be an operator defined over the 3-dimensional Euclidean space, - 
such that x = (x, a)a, where a is a given vector. Prove that . is a linear 
operator. 
11. Let ./ be a linear operator that maps an arbitrary vector x = (x', x’, 
х?) as ox = (2x! — x? — x3, x! — 2x? + x, x! + x? — 2x3). Find the 
image, kernel, rank and nullity of .24 
12. Find the matrix of the differential operator defined over the 
2-dimensional linear space generated by the base functions 


g(t) =e'cost and y(t) = e'sint. 
13. Let 


00 1 
0 1 0 
100 


be a matrix of an operator .o relative to the basis 1, t, t^ of the space M;. 
Find the matrix of - relative to the basis formed by the polynomials 
2 4 2t, SÊ +3 ae Иа 5E E: 
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14. Compute the eigenvectors and the eigenvalues of the operators defined 
by the matrices 


4 -1 -2 
2 0 
о (0 2). ®(2 1-2 


15. Let an operator define the rotation of a plane through the angle m. 
Find the operator adjoint to the given one. 2 
16. Convert the quadratic form 2x? + 5y? + 2z? — 4xy — 2xz + 4yz to the 
diagonal form. 

17. Specify what surfaces are given by the equations 

(a) 7х2 + 6y? + 522 — 4ху — 4yz — 6x — 24у — 18z + 30 = 0; 

(b) x? + 5y? + 22 + Ixy + 6xz + 2yz – 2x + 6y + 22 = 0; 

(с) 5х2 — у? + z? + 4ху + 6xz + 2x + 4y + 62 – 8 = 0. 


Answers 


1. The collection of polynomials of order not exceeding 2. 2. (a) yes; (b) yes. 

4. For example, (0, 1, 0, 0), (0, 0, 1, 0). 5. (1, 1, 1). 6. 4; xi, х, Xs, X4. 7. 7/4. 8. $, 
2 2 2 2 1 2 1 2 PE E 

"373 » 3" 37 Е 5 3° q^ 3 .9.y = (0, 3, 5, 2), 2 (2, 2, 2, 2). 

11. The basis of the image is y; = (2, 1, 1), y2 = (- 1, 2, 1). The basis of the kernel 15 z = 


1 0 0 

1 1 
(1, 1, 1). The rank is 2. The nullity is 1. 12. ( | 13.| -15/4 -4 —5 
Tod 94 3 4 


one 


21 1 
14. (23029 = 2; ( Мм = 1, X = 2, № = 3 and (9. (?): 1 |, [0 1, 
1 1 


15. The adjoint operator defines the rotation through the angle — 5: 16. Х? + ТҮ? + Z5 
хэ Ба даг ee a ee Fee eee Gave a „1. у 
v2 уб, RE 2 у6 v3 v2 , V6 5 v3 

NOTE: PANE гор : Х Ү 2 : 
17. (a) ellipsoid 78 * 1 + 2/5 1; (b) hyperboloid of one sheet 1/3 + 1/6 1712 1; 
х ү? 


атма — 2/4 — 

















27. 





(с) hyperbolic paraboloid 





Chapter 7 


An Introduction to Analysis 


7.1 Basic Concepts 


Sets. Set is a basic undefined concept in mathematics. We shall be 
content with the understanding that a set is a group or a collection of well- 
defined distinguishable objects which are thought of as a whole. It may 
be a set of letters printed on this page, a set of grains of sand on the 
seashore, a set of all roots of an equation or a set of all even numbers. 
Each object in a set is called an element or a member of the set. To signify 
that an element a is contained in a set A we write a € A. The notation 
a ( A means that a does not belong to A. 

Let A and B be two sets. If every element in A is also contained in 
B we say that A is a subset of B and write A C B. For instance, if Z is 
a set of all whole numbers and Z’ is a set of all even numbers then Z' C Z. 
Notice that always A C A. 

If A C B and BC A, ie, if every element in A is also contained in 
B and vice versa, we say that A and B are equal and write A = B. This 
means that a set is uniquely defined by its elements. So we may define 
a set by listing all the elements of the set enclosed in braces. The sets 


A = (а), А = (a, b}, А = (a, b, cJ 


consist of just one element a, two elements a and b, and three elements 
a, b and c, respectively. Sometimes it is impossible or impractical to list 
all elements of the set. In this case three dots are used to represent unlisted 
elements, e.g., 


A= fa, b,c, ...] 


is a set that consists of a, b, c and some other elements. To define the 
unlisted elements we shall use a written description that must fit all ele- 
ments of the set and only elements of the set. For example, we shall write 

the set of natural numbers {1, 2, 3, ...], 

the set of squares of natural numbers {1, 4, 9, ...}, 

the set of primes (2, 3, 5, 7, ...}. 

If A C B and A z B, A is called a proper subset of B. 


230 7. An Introduction to Analysis 


Sometimes we do not know in advance whether a set contains at least 
a single element or not. So it is helpful to introduce the notion of an empty 
set, і.е., a set with no elements? We shall denote the empty set by Ø. The 
empty set is a subset of any set or any set contains the empty set as its 
subset. 

Operations on sets. Let A and B be two sets. The union of two sets 
A and B is the set C = AUB of all elements contained either in A or in 
B, or in both. The intersection of two sets A and B is the set C= ANB 
of all the elements contained both in A and in B. For example, let A = 
(1, 2, 3} and В = (2, 3, 4, 5}. Then AUB = (1, 2, 3, 4, 5) and ANB = 
(2, 3]. 

If ANB = Ø, A and B are said to be disjoint sets. 

We can similarly define the union and intersection of any number of 
sets. 

Finite and infinite sets. A set is said to be finite if it has a finite number 
of elements. A set of all residents of a specific town and a set of people 
living on Earth are examples of finite sets. 

A set is said to be infinite if it is not finite. The set N = (1, 2, ...} 
of all natural numbers is infinite. 

Let A and B be two sets. We say that a one-to-one correspondence is 
set up between A and B if each element of A is associated with an element 
of B so that (i) distinct elements of A are associated with distinct elements 
of B and (ii) each element of B is put into correspondence with an element 
of A. It is easy to see that if a one-to-one correspondence is set up between 
the sets A and B these sets are called equivalent. We write А — В to mean 
that A and B are equivalent sets. 

An infinite set is said to be countable if it can be put into a one-to-one 
correspondence with the set IN of natural numbers, i.e., if the set is equiva- 
lent to №. Any infinite set contains a countable subset. 

It can be known that the set of all rational numbers is countable while 
the set of all real numbers is uncountable. 


Real numbers. The numbers 1, 2, 3, ... are called natural numbers. 
Every number which can be expressed as a fraction of the form + Е , Where 


m and п are natural numbers, and zero are rational numbers. Thus every 
positive integer and every negative integer are rational numbers. All rational 
numbers can be expressed as repeating decimal fractions. Unlike rational 


? It has not yet been established whether a set of natural numbers л such that the equa- 
tion x^ *? + y"*? = 2772 has positive integral solutions is empty or not. (In other words, 
it has not yet been established if Fermat's last theorem is true or false.) . 
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numbers, irrational numbers can be represented by infinite nonrepeating 
decimal fractions. The union of rational and irrational numbers forms a 
set of real numbers. It can be shown that the set of all rational numbers 
is countable while the set of all real numbers is uncountable. By convention 
the sets of natural, whole, rational and real numbers are denoted by N, 
Z, Q and R, respectively. We shall not give formal definitions of basic 
properties and operations on real numbers assuming that these are well 
familiar to the reader from the course of high-school mathematics. 

Absolute values of real numbers. Let a be a real number. The absolute 
value (or modulus) of a is equal to a if a is positive and is equal to — a 
if a is negative. The absolute value of zero is zero. We denote the absolute 
value of a by |a| and write 


[oie a if a20, 
-a if а<0. 
The inequality |х| < a, where a > 0, is equivalent to the relation 
-а $ ха. 
(Show that this relation is true.) 
The basic properties of the absolute values are: 
(1) |a- b] = |a|- |6. 


(2) 5 = -L (b z 0). 


ча Relations (1) and (2) are direct consequences of laws of multiplication 
and division of real numbers and the definition of the absolute value of 
а real number. »* 


(3) Ja + b| < |a| + [5]. 








-* Indeed, it is easy to see that 
-|a| < a < lal, 
-|b| < b < [0]. 

Then adding the inequalities termwise, we obtain the inequality 

– (la| + |b|) <a + b< |a| + |b], 

which is equivalent to the desired relation |a + b| < |а| + |b]. > 
(4) |а - Jbl] < la — bl. 

ч Indeed, the relation 
|a| = Ka — b) + b| < |a – b| + |b| 
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implies that 
la| – |b] < |a — bl. (*) 
Similarly, the relation 
|b| = (b — a) + a| < |b — a| + |a| = |a — b| +[а| 
yields 
la — b| > |b| — |а| 
or 
la| – |b] 2 -la — b]. (жж) 
From (*) and (жж) it follows that 
-l]a - b| < |a| - |b| < la – bl. 
Whence we obtain the desired inequality 
[а - ||| < la — b|. > 


Absolute and relative errors. We shall introduce some notions that are 
widely used whenever we apply numerical methods to compute approximate 
solutions of mathematical problems. 

Let a be a true value of some quantity and a* be an approximation 
to a. We shall call a the exact number and a* the approximate number. 
The simplest measure to estimate the precision of the approximate number 
a* is the absolute error of a*. We say that a positive number A(a*) is the 
absolute error of a* if 


la — a*| < A(a*). (жж) 


This definition of the absolute error is rather ambiguous. For example, 
if both a and a* are known the absolute error of a* is exactly equal to 
the absolute value of the difference between a and a*. However, we may 
not know the value of a. In this case inequality (***) means that the abso- 
lute value of the difference between a and a* does not exceed A(a*) and, 
consequently, any other positive number larger than A(a*) may also be 
regarded as the absolute error of a*. 

The absolute error refers to the precision of approximation and tells 
us nothing about the accuracy. For example, if we have made two measure- 
ments of temperature with the same absolute error equal to 0.2 °C and 
found the readings 1000 °C and 10 °C, both measurements аге to the same 
level of precision. However it is easy to see that the former is more accurate 
than the latter. 

The accuracy of an approximation refers to the relative error. We say 
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that a positive number ó(a*) is the relative error of the approximation a* 

to a if 

a—a* 
a* 


< ó(a*) (a* = 0). 








The relative error is usually expressed as a percentage or a decimal fraction 
of a*. 

To signify that a* is the approximation to a with the absolute error 
A(a*) we write 


= а* x A(a*), 


where a* and A(a*) are expressed as decimal expansions with the same 
number of digits to the right of the decimal points. For example, the rela- 
tion a = 5.272 + 0.003 means that 5.272 — 0.003 < a « 5.272 + 0.003. 
Similarly, if a* is the approximation to a with the relative error ó(a*) 
we write 
а = a*(1 + ó(a*)), 


where ó(a*) is expressed as a decimal fraction of a*. 

Significant digits. The first nonzero digit that occurs in the decimal ex- 
pansion of the approximate number a* on the left and all subsequent digits 
to the right of this digit are called the significant digits. For example, in 
аў = 0.2015, af = 23.653, аў = 0.00201500 the underlined digits are sig- 
nificant. 

A significant digit is said to be accurate if the absolute error A(a*) of 
the approximate number a* does not exceed the number whose decimal 
expansion contains a unity in the decimal position corresponding to this 
significant digit. If in the decimal expansion of a* the last accurate digit 
occupies the kth position, counting from the left, the approximate number 
а* is said to be accurate to k decimal positions. For example, if а* = 0.3745 
and A(a*) = 0.0001 then all nonzero digits in a* are significant. If 
а = 23.6538 and a* = 23.653 then the approximate number is accurate to 
3 decimal positions or to 5 digits, for the absolute error Д(а*) = 0.0008 
is smaller than the number obtained by substituting a unity for the last 
digit on the right and zeros for the other digits in a*, ie. 
A(a*) = 0.0008 « 1 х 1077. 

Sometimes it is helpful to apply a narrow definition of the accurate 
digit, namely, a significant digit is said to be accurate if the absolute error 
A(a*) of a* does not exceed the number whose decimal expansion contains 
half a unity in the decimal position corresponding to this significant digit. 
For example, if a = 14.98 then the approximate number a* = 15.00 is ac- 
curate to L decimal positions or to 4 digits since A(a*) = 0.02 < 
0.5 x 107°. 
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The absolute and relative errors are usually represented by numbers with 
two or three significant digits. 

Number line. Real numbers can be conveniently displayed as points on 
a line (Fig. 7.1). 

Suppose that we have drawn a line and fixed the direction, the origin 
O and the unit distance e for this line. To display a real number a on the 
line we choose a point to the right of O such that the distance between 
this point and the origin O is equal to a if the number a is positive. When 


e 


ТИЕ tun eL 
0 a 


Fig. 7.1 


a is negative we choose a point to the left of the origin O at the distance 
equal to the absolute value |a| of a. Clearly, the number a = 0 corresponds 
to the origin O. Therefore we have set up a one-to-one correspondence be- 
tween the points of the line and the elements of the set of real numbers 
so that each real number is associated with one and onfy one point on 
the line and vice versa. 

The line whose points are put into a one-to-one correspondence with 
elements of the set of real numbers is called the number line. 

We shall denote by x the point on the number line that corresponds 
to the real number x. 

The number line is a mathematical model which is helpful to interpret 
relations between real numbers. The inequality x, < x. means that on the 
number line x; lies to the left of x». The inequality xı < xs < x» means 
that x; lies between х; and x». 


Intervals. The important notions of intervals on the set of all real num- 
bers are widely used in analysis. 

A set of real numbers x is called 

(a) the closed interval [a, b] if the inequality a < x < b holds for every 
x in the set; : 

(b) the open interval (a, b) if the inequality a < x < Б holds for every 
x in the set; 

(c) the half-open or half-closed interval (а, b] if the inequality 
а < х < b holds for all x in the set. (Similarly, the set of real numbers 
x satisfying the inequality a « x « b is also the half-open or half-closed 
interval denoted by [a, b).) 

We shall also consider infinite intervals by introducing points at infinity, 
+ о and — со being the positive and negative infinities, respectively. For 
example 
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(a, +оо) is the set of real numbers such that x > a, 

[G, +оо) is the set of real numbers such that x > a, 

(— œ, b) is the set of real numbers such that x < b, 

(— оо, b] is the set of real numbers such that x < b, 

(— оо, + оо) is the set R of all real numbers. 

Neighbourhoods of points. Let xo be a point on the number line and 
6 > 0 be a real number. 

The interval which contains x is called the neighbourhood of хо. The 
interval (xo — 6, xo + ô) which is symmetric relative to xo is called the ô- 
neighbourhood of xo (Fig. 7.2). Therefore the ó-neighbourhood of хо is the 
set of real numbers that satisfy the inequality |x — xo| < or 
Xo — 6 « x < xo + ô. The interval (xo — ô, xo + 5) which does not contain 
the point xo is called the deleted ó-neighbourhood of x. 


BÓ BAI tied i ih) — — 


Xg-Ó Xp Ito 


Fig. 7.2 


Bounded and unbounded sets. Let E be a set of real numbers. Then 
the set E is called 

(a) bounded above if there exists a number b such that x < b for all 
хєЕ, 

(b) bounded below if there exists a number a such that a « x for all 
ХЄЕЁ, 

(c) bounded if E is bounded above and bounded below, ie, if there 
exist numbers а and b such that 


a«x&b forall хєЁ. 


Therefore the set E is bounded if E is contained in the closed interval [a, 5]. 

For example, the set E = (— œ, 1] is bounded above and the set of all 
natural numbers is bounded below. 

A set which is not bounded above (below) is called unbounded above 
(below). For example, the set of all natural numbers is unbounded above 
and bounded below. The set of all negative numbers is unbounded below 
and bounded above. 

The sets of all integers, of all rational numbers and of all real numbers 
are unbounded. 


Supremum of a set. Let E be a set bounded above, i.e., let there exist 
a number b such that x < b for all x € E. Then b is called an upper bound 
of E. Any number Р” which is larger than b is also an upper bound of Е. 
Definition. The number M is called the supremum of E provided that 
(1) for any x € E there holds x < М, 
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(ii) for any (whatever small) number є > 0 there exists a number x* € E 
such that M — e < x* < M. 
In other words, the supremum of Е is the /east upper bound of E. We 
shall denote the supremum of E by M = supE or M = sup {x}. 
xe 


If E is a set unbounded above we put the supremum of E equal to + oo 
and write sup E = +o, | 

Infimum of a set. Let E be a set bounded below, i.e., let there exist 
a number a such that a « x for all x € E. Then a is called a /ower bound 
of E. Clearly, any number smaller than a is also a lower bound of Е. 

Definition. The number т is called the infimum of E provided that 

0) for any x € E there holds x > т; 

(ii) for any (whatever small) number ғ > 0 there exists a number x* € E 
such that m «€ x* < m + t. 

Thus, the infimum of E is the greatest lower bound of E. We shall 


denote the infimum of E by m = inf E or m = inf {x}. 
хєЁ 


If E is a set unbounded below we put the infimum of E equal to — oo 
and write inf E = — œ. 

To illustrate the notions of supremum and infimum of a set we consider 
the following examples. If E = [a, Б] then inf E = a and sup E = b. If 
E = (a, b) then again inf E = a and sup E = b. Notice that inf E and sup E 
are contained in E in the former example and do not belong to E in the 
latter one. For the set E — D 2) FACE 1 , 2) we have inf Е = 0 and 
sup E = 1. 

In conclusion we state the following theorem. 

Theorem 7.1. A non-empty set of real numbers which is bounded above 
has a supremum and a non-empty set of real numbers which is bounded 
below has an infimum. 

Logical symbols and connectives. To abbreviate the notation and simpli- 
fy the definitions we shall use logical symbols and connectives. 

The universal quantifier v is read “For every”, “For any”, “For each", 
“For all". 

The existential quantifier 3 is read “There is", “There exist", “There 
exists at least one...” or something equivalent. 

We say that a declarative sentence is a statement if it can be true or 
false but not both. For example, the sentences “Mathematics is a science”, 
“The number 2 is smaller than the number 3" and "The number 6 is a 
prime" are statements whereas the sentences "Close the door" or "How 
old are you"? are not. We shall denote the statements by Greek letters a, 


B, y, etc. 
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The implication a 8 (read “if o then 8” or “о implies В”) is a false 
statement if and only if o, called the antecedent, is true and 8, called the 
consequent, is false. We have to distinguish the implication from the cause- 
and-effect relation. Unlike the latter the implication a = 0 is true whenever 
the antecedent o is false. In other words, a false statement implies any state- 
ment, e.g., “if 2 x 2 = 5 then the unidentified flying object has landed near 
your house" 

The equivalence a € В (read “о if and only if 8”) means that the state- 
ments a and 8 are logically equivalent. 

The conjunction a ^ В (read “о and 8”) is a compound statement made 
up of the statements a and 8 connected by the conjunction and. The con- 
junction a ^ 8 is regarded as a true statement if and only if both a and 
8 are true. 

The disjunction а V В (read “a or 8") is a compound statement made 
up of the statements a and 8 connected by the conjunction or. The disjunc- 
tion is thought of as true if and only if at least one of the statements is true. 

Let o be a statement. The statement a (read “not o») is called the nega- 
tion of a, a being true if о is false and vice versa. 

To negate a statement that involves the quantifiers we have to substitute 
every universal quantifier for the existentional quantifier and vice versa and 
replace the antecedent by its logical opposite so that if 8 = у then 
В= у ө Влт. 

Necessary and sufficient conditions. Let the theorem “If the statement 
о is true so is the statement 8” be true. The statements а and 8 that can 
be compound statements are called the hypothesis and conclusion, respec- 
tively. The theorem can be symbolized as the implication a = 8 and can 
also be expressed as 

a is a sufficient condition for 8 
or 
B is a necessary condition for a. 

Now we shall find out what we mean when speaking of the necessary 
and sufficient conditions. 

Let В be a statement. We say that a statement о is a sufficient condition 
for 8 if œ implies В and а is a necessary condition for 8 if a follows from £. 

Let a and 8 be two statements given as follows 

a: “The number x is equal to zero", 


8: “The product xy is equal to zero". 
Then a = 8. 


ча Indeed, for xy to be equal to zero it is sufficient that x be equal to 
zero. For x to be equal to zero it is necessary that xy be equal to zero. 
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But 8 is not a sufficient condition for a since x can be distinct from zero 
when xy is equal to zero. ® 

If о and 8 are the statements each of which implies the other, i.e., a = 8 
and 8 = a, we say that each of œ and 6 is the necessary and sufficient 
condition for the other and write 

а e.b. 

The following expressions all mean that о is the necessary and sufficient 
condition for 8 and vice versa: 

(a) for o to be true it is necessary and sufficient that 8 hold; 

(b) а holds if and only if 8 is satisfied; 

(с) а is true if and only if В is true. 

Mathematical induction. It is not a rare case when a statement which 
is true in some particular instances turns out to be false in general. For 
example, if we compute the values of 9915? + 1 for the subsequent natural 
numbers 1, 2, 3, ..., 1079 we shall fail to get at least one value which is 
equal to the square of a natural number. Based upon this experience we 
might conjecture that the expression 9915? + 1 will never produce squares 
of natural numbers when л is natural. However this conclusion would be 
false. The point is that the smallest л such that the value of 991n? + 1 
becomes equal to the square of a natural number is extremely large, viz., 


n = 12055735790331359447442538767. 


Against this background it seems reasonable to draw our attention to the 
following problem. Let there be a statement which is true in some particular 
cases. How can we prove that it is true in general without having to verify 
it for each particular case that would be an impossible task? 

An important tool which enables us to answer the imposed question 
is the mathematical (complete) induction method based on the principle 
of mathematical induction. 

Principle of mathematical induction. Let a be a statement that is true 
for certain л. Then а is true for all natural п provided that 

(a) о is true for n = 1; 

(b) if о is true for n = k then a is true for n =k + 1. 

This principle lays down the basis of mathematical reasoning. 

To illustrate how the principle of mathematical induction works we shall 
prove Bernoulli's inequality: 

If h > —1, then 

(1+ А)" >21 + лпй wnen. (*) 

Clearly, inequality (+) is true for n = 1. Assume that (*) has been proved 
forn=m> 1, te, 

(1 + A)" 2 1 + mh. 
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Multiplying both sides of this inequality by (1 + Л) > 0, we have 
(1+ h)"*! » (1 + mh) + А) = 1+ (mt Dh + mk. 
Deleting the nonnegative number mh? from the right-hand side, we obtain 
(1 + д)" 2 1 9 (m + DA. 


Thus (8) is true for n = m + 1. Hence, Bernoulli's inequality (8) is true 
for all n € N. 


7.2 Sequences of Numbers 


Notion and notation. Let every natural number n be associated with 
а real number a, and let a rule that puts n into correspondence with an 
be known. Then we say that a sequence of numbers 


а, 0, ..., аһ, ... 


is defined on the set of natural numbers. 

The numbers ai, бо, ..., аһ are called the terms of the sequence. We 
call а, the nth term of the sequence; a, is usually given by а law which 
enables us to compute any other term of the sequence. 

To abbreviate the notation we shall denote the sequence a, a, ..., 
йл, ... by writing (a,}”. 

Examples of sequences are 





Lp d d 1 . 

n аР LN 

(271 2.2.48... 2. см 

{1} =1,1,1,...,1,...; 

(-1y*! 1 1 (-1)"*! 
pua ө a АШ gosse ed 


(cos n] = cos 1, cos2, cos3, ..., COSR, ... . 


Limit of a sequence. A number A is said to be the /imit of a sequence 
{аһ} if, given any positive number e, there exists a number N such that 
for all и> N 


lan — A| < е. 


? We have to distinguish the sequence fan) from the set (a,). For example, (5| = 
5, 5, ..., 5, ... is a sequence whereas the set (5) contains a single element, 5. 
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We shall write 


А = lima, or а, — A 
nd no 
to mean that A is the limit of the sequence {a,}. 
Using logical symbols we can write the definition of a limit as 


(lim a, = А) ө ме> 0 IN миь №» |а, – A| < e. (7.1) 
no 


The notion of a limit is easy to interpret by displaying the terms of 
the sequence (an) on the number line (Fig. 7.3). The inequality 


а, A-E ам; A Ino} A*& In а, 
Fig. 7.3 


|а — A| < € is equivalent to A — € < а, < A + е which implies that a, 
lies in the -neighbourhood of A. Hence, A is the limit of {an} if, given 
any e-neighbourhood of A, there exists a number N such that all a, with 
n > Nare contained in this e-neighbourhood of A, i.e., in the open interval 
(A — £, A + £). Thus, only the finite number of terms a1, а, ..., ам can 
be outside the open interval (A — е, A + є). Whence it follows that the 
sequence with all terms equal to A, called the stationary sequence, has the 
limit equal to A. 

A sequence is said to converge if it has a finite limit and to diverge 
otherwise. 








Example. Consider the sequence ( 2 2 І |. Qn = Л : | being the nth 


n+l cd 





s|- 


term. Clearly, the larger n the closer to 1 is the fraction 


This prompts us to assume that 
ntl — 1 





lim а, = lim 


now noo 


ча To prove that our assumption is true we shall take an arbitrary ғ > 0 
and show that there exists a number № such that for all и> № 


nb. <e. (7.2) 
n n 





la. — 1| = 





From the inequality l < є we have n > . Then for any natural N 
exceeding 1 and all n > N there holds l < є, ie., (7.2) is true. Hence, 


by virtue of the definiton of a limit we have lim aq, = 1l. » 


n>w 
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In general the number N is not independent of є, ie, N = N (e). For 
instance, referring to the previous example, we observe that we can choose 
М equal to 10 or to any other number exceeding 10 when є = 0.1. However, 
if е = 0.01 N has to be larger than 100. 

Remark. The number N is not uniquely defined by the value of ғ in 
the sense that if inequality (7.1) holds for all n > № it also holds for all 
п> №, where № > №. Then to prove that lim a, = A it suffices to 


no 
choose any number N such that |an — A| < e for all n > N. Therefore we 
do not need to find the smallest number N satisfying (7.1). 
How we shall prove two important theorems on limits of sequences. 
Theorem 7.2 (Cauchy convergence criterion). For the sequence ai, a2, 
‚„..› Qn... to converge it is necessary and sufficient that for any € > 0 
there exists N such that for all n> N and all т> № 


lan — am| < 6. 


A sequence (л | satisfying Theorem 7.2 is said to be a Cauchy sequence. 
Theorem 7.3 (on uniqueness of a limit). A sequence (a,) can not have 
two distinct limits. 
*« Let A be a limit of {an} and let B # A. To prove that B is not a limit 
of {a,} we take є so small that the e-neighbourhood of A and the e- 
neighbourhood of B do not intersect. For the purpose it suffices to set 


- сше (Fig. 7.4). 
а —— НАА = 
А B 
Fig. 7.4 


Since lim a, = A, only a finite number of terms of [а, | can be outside 


nea 

the open interval (A — є, A + £). Hence, the open interval (В — є, В + є) 
contains at best a finite number of terms of {an} and B is not a limit of 
(a). > | 

Bounded sequences. A sequence {an} is called 

(a) bounded above if there exists a number M such that a, < M for all n; 

(b) bounded below if there exists a number m such that a, > т for all n; 

(c) bounded if {an} is bounded both above and below, i.e., there exist 
numbers т and M such that m < a, < M for all n. 

Notice that all terms of a bounded sequence are contained in the closed 
interval [m, M] of the number line. 

It is easy to observe that the sequence ..., —n, ..., – (п – 1) ..., 
—3, —2, —1 is bounded above, for all its terms are negative, whereas the 
sequence 1, 2, 3, ..., n, ... is bounded below since all its terms are not 


16—9505 
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+1 
smaller than 1. The sequence {an}, where a, = п 





= 1+! isthenth 
п 


term, is bounded since 1 < a, < 2 for all n. 

Sometimes it is helpful to use a slightly different definition of a bound- 
ed sequence, namely, a sequence {a} is called bounded if there exists a 
number K > 0 such that 


|а| < К vn. 
By using logical symbols we can write this definiton as 
({a,} is bounded) e ЗК >0 vn Ja, < К. 


A definition of an unbounded sequence is easily obtained from the defi- 
nition of a bounded sequence by interchanging the quantifiers and convert- 
ing the inequality involved, i.e., 


({a,} is unbounded) e УК > 0 an |a,| > К. 


Example. Prove that the sequence (27| is unbounded. 
ч Evidently, for апу К> 0 there exists n such that 2" > К, ie, 
n > log? К. Hence, the sequence (2"] is unbounded. » 

A sequence is said to diverge to œ, and we write lim a, = оо, if, given 


nw 


any, whatever large, M > 0 there exists a number N = N(M) such that 
lan] > M vn» М. 
A sequence {a,} such that 
VM>0 ЗМ wn>N а> М (а < -M) 


is said to diverge to + оо (— оо). In this case we write lim a, = + oo 
(lim a, = — oo). ad 


nouo 

We shall say that a sequence which diverges to œ, + oo or — оо is infinite- 
ly large. It has no finite limit satisfying the definition of a limit given above 
since the symbols со, + oo and — œ do not represent real numbers. In what 
follows we shall mean a finite limit as a real number. It is also worth men- 
tioning here that there exist unbounded sequences which are not infinitely 


large. For example, the sequence {п sin F} is unbounded but does not 


diverge to infinity. 
Theorem 7.4. Every convergent sequence is bounded, ie, there exist 
numbers m and M such that for all n 
т< d, < М. 


* Let lim a, = A and let є > 0 be an arbitrary number. Then there exists 


now 
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N such that the open interval (A — £, А + £) contains all the terms a, with 
п> Мапа a, @, ..., ам are the only terms of the sequence which can 
lie outside this interval (Fig. 7.5). Thus only a finite number of terms can 
lie outside the interval and we can choose the smallest d and the largest 
& of them. Now let m = min (4, A — €} be the smallest of the numbers 
ã and A — ғ and let M = max (à, A + €} be the largest of the numbers 
à and A + e. Then the closed interval 1т, M] contains the terms ал, Ф, 

.., ам and the open interval (A — e, A + e). Since all the terms ал with 
n > М + 1 belong to (A — e, A + e) the closed interval [m, M] contains 
all the terms of the sequence {an}. Hence, (2,] is a bounded sequence. В» 


а! ay A-E One A ань? А+& а, 


Fig. 7.5 


Theorem 7.4 implies that for a sequence to converge it is necessary that 
this sequence be bounded. However, this is not sufficient for a sequence 
to converge, as is easily seen from the following example. 

Example. The sequence 


1, 0, 1, 0, 1, ... (*) 


is bounded but diverges. 
ча To prove that the sequence diverges we suppose the converse, i.e., the 
sequence (ж) has a limit, say, equal to A. Then for any ғ > 0 say fore = І E 
there exists N such that 


|а — A| «i vn» М. 
Since the terms of (*) alternate between 0 and 1 there must hold 
[0 — A] = JA] «i and і -Ap «i vn» М. 


Whence, we have 
1 1 1 
1= | -—A)+ 4|< |1 - Al + 14| «3 tI T7’ 
that is, 1 < 1 , which is impossible. This means that our assumption on 


convergence of (») is false. Hence, the sequence (*) does not have a limit, 
ie, it diverges. 9 

Operations on convergent sequences. We shall discuss a number of theo- 
rems which define arithmetic operations on convergent sequences and 
basic properties of limits of sequences. 


16* 


244 7. An Introduction to Analysis 





Theorem 7.5. Let {an} and {bn} be sequences which converge to A and 
B, respectively. Then (аһ + bn} is a convergent sequence and 
lim (a, + bj) = A + B= lim a, + lim by. 


п о nw 


* Let є 0 be an arbitrary number. Since lim a, = A, there exists Ni 


now 


such that 
|а – А1 <2 vn» M. (8) 


Similarly, since lim 5, = B, there exists Nz such that 


no 
|b. — В| <5 vn > №. (жж») 


Now let N be the largest of N; and N2. Then for any n > N there hold 
both (+*+) and (ээх), Hence, we have 


Кал + bn) — (A + B)| = (а, – A) + (bn – B)| 


< |an — A| + |bn - В| «5 кё. 


2 €. 


Thus 
v£ 20 IN vn» Мэ» (аһ + b.) - (А + B)| < е. 


By virtue of the definition of a limit we infer that A + B is the limit 
of the sequence (а, + b,]. > 
Theorem 7.5 can easily be generalized to any finite number of conver- 
gent sequences. 
Analogously we can prove 
Theorem 7.6. If {an} and {bn} are convergent sequences so is (ay — bn} 
and 
lim (a, — bn) = lim a, — lim bn. 
nwo nov no 
Also of importance are 
Theorem 7.7. If {an} and {bn} are convergent sequences so is (as: bn} 
and 
lim (@,- bn) = lim ал: lim by. 


n- o п— о no 


Theorem 7.8. If {an} and (bs) are convergent sequences and if b, = 0 


noo b, 


lim а, 


lim Озге 
пә On lim br ` 


no 


for all n and lim b, #0 then x] is a convergent sequence and 
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Monotone sequences. A sequence {an} is called 

(a) nondecreasing if а < 2 < ... <а„<а„+\<...; 

(b) nonincreasing if ау 20: >... > ал Z а+1 7... 

(c) monotone if {an} is either nondecreasing or nonincreasing. 

A nondecreasing sequence (a,] is bounded if it is bounded above, i.e., 
if there exists a number M such that a, < M Ул, for all the terms of {an} 
are contained in the closed interval (ат, M]. 

A nonincreasing sequence {a,} is bounded if it is bounded below, i.e., 
if there exists a number m such that а, > m vn, for all the terms of {an} 
are contained in the closed interval [m, a]. 

Theorem 7.9. Every monotone and bounded sequence has a limit. 
ча Since a sequence {dan} is bounded the terms of {an} form a set which 
has a supremum and an infimum. Let M be a supremum of this set and 
let us show that lim а, = M provided that (a,] is a nondecreasing se- 
quence. s 

By definition of a supremum for any = > 0 there exists aw such that 
an > M — e and ay < М. Whence, we have 0 < M — an < є. Since {a,} 
is a nondecreasing sequence we have 


0<M-a<M-an уп>М. 
Then by virtue of M — an < є, we get 


0O<M-a<éewn>N 
or, equivalently, 
lan - М| <= vn» М. 


Whence it follows that M is a limit of {an}. 
Analogously we prove that lim a, = т provided that {an} is a nonin- 


п с 
creasing bounded sequence and m is an infimum of the set of the terms 
of {an}. > 
Remark. For a sequence to converge it is not necessary that this se- 


( ЕБ 1)” +1 
quence be monotone. For example, the sequence [-2—| is not mono- 


tone but converges to 0, ie, lim a, = 0. 
п— с 
From Theorem 7.9 follows the nested closed interval theorem sometimes 
called Cantor lemma. 
Cantor lemma. Let there be given a sequence of the nested closed in- 
tervals 


On = [G, b] (п = 1, 2, ...), 


such that on41 С on (п = 1, 2, ...) and d. = b, — а, > 0 as n > œ. Then 
there exists only one point which belongs to all on. 
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Cantor lemma refers to the remarkable property of a set of real num- 


bers, 16., the completeness of a number line, which implies that real num- 
bers fill the number line without leaving "holes" in it. 


Number e and natural logarithms. Let 


аһ = ( + i) (*) 


be the nth term of sequence {an}. Then substituting 1, 2 and 3 for n in 
(ж), we get 


Ё _ TFs il = ОЬ В. 
a = 2, a= (1+5) =2+2, a= (1+4) =2+5 + 57 


Thus a; < @ < аз. 
Using the binomial theorem*, we can easily show that the sequence 


n 
(( + 1) } is a bounded and monotonically increasing sequence and 


duas (1+) «3 мп. 


This means that the sequence has a limit which is customarily denoted as 


1 n 
е = lim (: +t) . 
no n 


The number e is an irrational number and it can only be approximated: 
e = 2.7183... . Sometimes, in working with complicated expressions, it is 
convenient to use the number e as the base of logarithms. The logarithm 


of the number x > 0 to the base e is called the natural logarithm and denot- 
ed by In x. 


? The binomial theorem is given by the formula 
n " ан 
(a + by" = а" + -y a'b + T D quip 
= Dian: - 
FOND £O Do m Er gre yt +... +8", 


where! 21x 2x 3x ... Xk. 
Substituting 2 and 3 for n in the binomial formula, we obtain the formulas of abridged 
multiplication representing the square of the sum of two real numbers 


(a+ b? = +7 ab + 2X) pr gts rab + 0? 





and the cube of the sum of two real numbers 


3 2 3x2 2 3x2x1 ,3 
аж 2! ab 3! b 


= а? + 3a?b + 3ab? + b’, 


(а + by = аў + 





which are familiar to the reader from the course of high school algebra. 
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Remark. To prove that the sequence 1+ 1 has a limit it is help- 


ful to apply Bernoulli's inequality. Indeed, by virtue of this inequality we 
have 


m= (1+1) 214152 va: 
n n 


that implies that the sequence {a,] is bounded below. 
Consider a sequence (b,} whose nth term bn is given by 


ntl nl 
ь=-(1+)®- (1*1) (enn. 
n n n 


Evidently, b, — (1 + 3) a, > 2 Vn. Then we have 


Ви = n+1\"*! n+2\"%t2 Е (n + po? 
bn+1 E ( n ) | n + Г) n^ * ! (n " 2y*? 
- (n + 1y^**.gn 
(n + D([G + 1) — Ши + D + 11772 


_ п (п + pt n (n + 1)? п+2 
n+l [(п+1)5—1]'*?° © nti | (n+l 


n 1 1 n+2 
= + : 
| eel 


Applying Bernoulli's inequality to the relation 


1 n+2 
[1+ a] 


we obtain 


Dn > n (1+ 012 )-sn i) 
биз ntl (п + 1) – 1 п +1 п 


ie, Dn > brat. 
Thus, the sequence (5,] is nonincreasing and bounded below. This 
means that {bn} has a limit and so does the sequence {an} and 
































lim 2, — lim *— = lim Dn. 


pee ENS 141 n+ 





7.3 Functions of One Variable and Limits 


Notion of a function. Similarly to the concept of a set a function 
is a primary basic notion. It can be used in analysis in slightly different 
ways. We shall confine ourselves to the following notion of a function. 
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Let X be a set of real numbers x and let there be given a certain law 
or rule which assigns a real number y to every number x in X. Then we 
say that there is a function defined on X and write 


у= Дх) or у= у(х), xeX?. 


The set X is called the domain of a function апа the set Y of values у 
specified by the function is called the range of a function. The domain 
is sometimes denoted by D(f) and the range by E(f) provided that the 
range of a function contains values y for every x in X. 

Sometimes a function is denoted by writing f instead of f(x). 

A function is fully determined if there are given (i) its domain of defini- 
tion X and (ii) a rule which associates every x in X with a certain value 
y = f(x). 

The functions f and g are said to be equal if D(f) = D(g) and the identi- 
ty f(x) = g(x) remains true for all x in D(f) = D(g). For example, the func- 
tions y = x^, -œ < x < +œ, and y = x?, 0 < x < 1, are not equal since 
their domains are distinct; these functions are equal on the closed interval 
[0, 1]. 

Examples of functions. (а) A sequence (а, } is a function of an integral 
variable whose domain is the set of natural numbers, such that f(n) = an 
(1-1, 2, ...). 

(b) y = n! is defined on the set of natural numbers. The relation n! 
(read n factorial) is equal to the product of all integers from 1 through 
n, ie, 


п! =1х2х3х... хи 


and by convention 0! = 1. 

Sometimes the symbol !! is used. The relation (2)!! is assumed to be 
equal to the product of all even integers from 2 through 2n. For example, 
8!! = 2х 4 х 6 х 8. The relation (2n — 1)!! is equal to the product of all 
odd integers from 1 through 2n — 1, eg., 7! 21x 3x 5 x 7. 


1 fo x>0 
(с) у = sgnx = 0 for x=0, 
-1 for x<0 


is defined at each point of the number line — œ < х < +o, The domain 


? This notion describes a numeric function and can easily be generalized to the case 
of arbitrary sets. Let M and N be two arbitrary sets. We say that there is a function f which 
is defined on M and assumes values in N if every element in M is associated with one and 
only one element in N. Thus the function f maps the set M on the set N and is sometimes 
called the mapping. Notice that the extended notion of a function is fully applicable to the 
case of sets of numbers. : 
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of y = sgn x is the set containing only numbers —1, 0 and 1 (Fig. 7.6). 
The abbreviation sgn x means signum function. 

(d) у = [x], where x is a real number and [x] is the largest integer not 
exceeding x, ie, [х] = n forn<x<n+1,n=0, +1, +2, .... This 
function is defined at every point of the number line and its range is the 
set of all integers (Fig. 7.7). 
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Fig. 7.7 


Representations of functions. A function can be expressed by a formula, 
a graph or a table. Respectively, we shall speak of the analytic, graphic 
or tabular representations of functions. 

(a) Analytic representation. ^ function y — f(x) is said to be expressed 
analytically if it is defined by a formula or an equation which specifies 
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to what operations each хє D(f) must be subjected to obtain the cor- 
responding value y of the function. For instance, the function y — dox 
(— со <x < +оо) is given analytically. In this case the domain of a func- 
tion, unless stated otherwise, is thought of as a set of real values x for 
which the formula defining the function assumes only real and finite values.” 
In this sense the domain of a function is sometimes called the domain of 
existence of a function. For example, the domain of y — VI — х2 is the 
closed interval —1 < x < 1 and its range is the closed interval 0 < y < 1. 
The domain of у = sinx is the infinite interval (the number line) 
— ео <x < +оо and the range is the closed interval —1 < y « 1. 
Notice that not all formulas define functions. For instance, the formula 


y-Nl-x! «Nx! - 4 j 


does not specify a function since for any real x at least one of the square 
roots does not assume a real value. 


У 










М(хо, (сс) 


f(x;) 








Xo 





Fig. 7.8 Fig. 7.9 


An analytic representation does not mean that a function is always 
specified by a single formula. A function can be given by different relations 
for different parts of its domain. For example, the function y — f(x) shown 
in Fig. 7.8 can be specified as 


0 forx«O, 
x forO<x<l, 


Дх) = 
2-xforl«x«2, 
0 forx»2. 


(b) Graphic representation. The graph of a function y — f(x) is a locus 
of points (x, f(x)) on the xy-plane whose abscissas are the values of x and 
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ordinates are the values of y (Fig. 7.9). We say that a function admits а 
graphic represeritation when its graph is specified. 

Notice that not all functions can be given by graphs. For example, the 
Dirichlet function 


1 for rational х, 
O for irrational x 


D(x) = { 


does not admit a graphic representation. The domain of this function is 
the number line and the range comprises two numbers 0 and 1. 

(c) Tabular representation. A function is said to be specified in a tabular 
form if it is represented by means of a table which contains numerical 
values of the variable x and the corresponding values of y. 

Limit of a function at a point. A limit is a basic concept of mathemati- 
cal analysis. 

Cauchy criterion for limits. Let f(x) be a function defined in a neigh- 
bourhood Q of a point хо, except probably at хо. Then a number А is a 
limit of f(x) at xo if, given any (whatever small) number ё > 0, there exists 
a number 6 > 0 such that 


f(x) — Al < є, 


whenever |x — xo| < 6 and x z хо. 
To signify that the number A is a limit of f(x) at the point xo we write 


lim f(x) = A 


X>Xo 


Using logical symbols we can express this criterion as 


(im ло) = 4) «veo 3020 vx, хя X», 


Ix — x| < 6 = |Д) — Al < е. 
Example. Let f(x) = 2x + 3 and х = 1. Verify that lim f(x) = 5. 
x71 


ч Clearly, f(x) is defined everywhere including the point xo = 1 where 
Қ) 25. Let є» 0 be an arbitrary number. For the inequality 
|(2х + 3) — 5| < e to hold it is necessary that 


Ix - 2] «e 2x- | «e x - I] <5. 


Thus, if 8-29 we have 0 < |x - 1| «8-3 and [f(x) — 5| < е. 
Whence it follows that the number 5 is the limit of f(x) = 2x + 3 at the 
point x = 1. > 

Geometric interpretation of a limit at a point. Let f(x) be given by a 
graph so that the values of f(x) are equal to the ordinates of the curve 
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M,M for x< хо and to the ordinates of the curve MM, for x > xo 
(Fig. 7.10). And let the value f(xo) be equal to the ordinate of the point 
N. In other words, we assume that the graph of f(x) is obtained from the 
“good” curve МММ» by replacing N for M. 

Let us show that at xo the function f(x) has a limit equal to the ordinate 
A of M. 









Fig. 7.10 


«1 We choose any (whatever small) ғ > 0 and fix the points with the or- 
dinates А, А — £, A + e on the y-axis. Let P and О be the points of inter- 
section of the graph of у = f(x) with the lines у = A — e and y= A +€ 
and let xo — hı and xo + Ло, where hı > 0 and h2 > 0, be the abscissas of 
P and О, respectively. We easily see for any x # xo in the interval (xo — hy, 
Xo + h) the value of f(x) lies between A — & and А + e, i.e., for all x z xo 
we have 


A-e<f(x)<Ate 


whenever xo — A < x < xo + Ё. 

Let ё = min (hı, h2} be the smallest of hı and h2. Then the interval 
(Xo — ô, xo + ô) is contained in the interval (xo — hı, xo + h2). Hence, the 
inequality A — € < f(x) < A + e or, equivalently, 

f(x) - А[ < ғ 
remains true for all х # хо belonging to the interval (хо — ô, xo + ô), i.e., 
for all x satisfying the condition 

0 < |x — %| < ô. 


Whence it follows that lim f(x) = A. » 


X> Xo 
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Thus we infer that the function y = f(x) has a limit A at a point xo 
provided that for any (whatever narrow) strip between the lines y = A — € 
and y = A + є there exists 6 > 0 such that the graph of y = f(x) lies inside 
this strip for all x belonging to the deleted 6-neighourhood of x. 

Remarks. (1) In general the value of 6 depends on the value of є, i.e., 

= 6(8). 

(2) When we deal with a limit of a function at a point xo we exclude 
Xo from consideration. Thus a limit of f at xo is independent of the value 
of f at хо. Furthermore, a function can even be undefined at xo. So any 
two functions equal to each other in a neighbourhood of хо, except pro- 
bably at the point xo, where they can differ or even can both be undefined, 
have the same limit as x > xo or have no limits at all. In particular, this 
implies that when computing the limit of a fraction at хо we may reduce 
this fraction by like factors vanishing at x = xs. 


Examples. (1) Compute lim 2, 
x70 X 


ча Notice that f(x) = x = 1 for all x z 0 and is undefined at х = 0. By 
the definition of a limit the point x = 0 is excluded from consideration. So 


lim * = lim 1=1. > 
x20X x0 


(2) Compute lim f(x) (see Fig. 7.11), where 
x0 


x^ for x #0, 


fœ = Ї for x = 0. 


-« The function g(x) = x?, -œ < x < + ою, is equal to f(x) everywhere, 
except at x = 0, and its limit at x = 0 is equal to zero, ie, lim g(x) = 0 
x20 


(show this!) Hence, lim f(x) = 0. » 
x20 


Problems. Write down the Cauchy criterion for the following limits 
(1) lim f(x) = 5, (2) lim f(x) = 0, 
xl x0 


(3) lim | Дх) = 1, (4 lim f(x) = -2. 
xc - E63 


Sequential criterion for limits. Let f(x) be a function defined in a neigh- 
bourhood Q of хо, except probably at хо. And let (x,], where x, € Q and 
Xn Æ Xo, be a sequence of values of x that converges to хо. Then a number 
A is the limit of f(x) provided that for any [x4] the corresponding sequence 
(f(%n)} of the values of f(x) converges to A. 

This criterion is convenient to apply when we wish to determine whether 
the function f(x) has a limit at xo or not. It suffices to define a sequence 
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(704) which has no limit or two sequences (/(x,)} and (f(x:)] having 
distinct limits. 


ча Consider the function f(x) = sint defined everywhere, except at the 
point x = 0 (Fig. 7.12). 











Fig. 7.11 Fig. 7.12 
We have 
RE RS x=-— + 0, 
AT п-ю and 2 + 2нт "77? 
Јо) = sin nr = 0, (X4) = sin G + inr) = 1. 


Hence we have found two sequences (х, } and (x; | converging to x = 0, 
for which the corresponding sequences of the values of f(x) converge 
to different limits, viz., (f(x«.)] converges to zero while (f(x;)] converges 


to 1. By the sequential criterion this means that f(x) — sin— has no limit 
x 


atx-0. » 

Remark. It can be proved that both criteria for limits at a point are 
equivalent. : 

Limit theorems. We shall derive some basic results which are of impor- 
tance in examining limits of functions of one variable. 

Theorem 7.10. Let f(x) have a limit at a point xo. Then this limit is 
unique. 
ча Let lim f(x) = A. We shall show that there exists no number В # A 


X Xo 
which is a limit of f(x) at xo. 
The statement lim f(x) z B can be written as 


х Xo 


3:20 vó»0 ax, x zx, |х-%|<6^|Дх)— Bl 2 e 
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Using | | - ||| € |a — b|, we have 
ifo) - В| = |V% - A) - (В - A)| > |у) - A| - IB - A|| 
= |8 - Al - Л) - Al]. (+) 


2. 8-4 
шан 


Let > 0. Since lim f(x) = A, then for the chosen e > 0 


х Хо 


there exists 6 > 0 such that 
f(x) - А| <= vx, xx, |x—-»x| « 6. 
Applying (+*+), we reduce the above statement to the following 
f(x) – B| ze vx, xx, [x — %| «6. 
Hence, we have found є > 0 such that for any (whatever small) 5 > 0 


there exists a point x # xo for which 0 < |x — xo| < à and |f(x) — В| 26 
Whence it follows that В z lim f(x) > 
Х- Хо 
A function f(x) is said to be bounded in a neighbourhood of a point 
Xo if there exist numbers M > 0 and ё > 0 such that 
Мо) < М vx€(Qx — 8, % + 8), 


where f(x) is defined. 
Theorem 7.11. Let f(x) have a finite limit at x. Then f(x) is bounded 
in a neighbourhood of xo, ie, ЗМ > 0, 36 > 0 and 


UG) < М мчхЄ(о-6, xo +ô), хя xe. 

а Let lim f(x) = А. Then for апу є > 0, say, € = 1, there exists ô > 0 
Х-? Xo 

such that 
Мо) – A| < 1 


whenever x # xo and |x — xo| < 6. 
Observe that 


F| – |A| < /0)-4| 
is always true. Then we have 
V| < 141 + 1. 


Let M = |A| + 1 if f(x) is undefined at хо and let M = max{|A| + 1, 
[f(xo)|} if f(x) is defined at x. Then 


o| < M 


holds at every point x in the interval (хо — 6, % + 6). By definition this 
means that f(x) is bounded in a neighbourhood of х. » 
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Theorem 7.11 implies that every function having a limit is bounded. 
However, the converse is not always true, ie., a function can be bounded 


in a neighbourhood of хо but have no limit at xo. For example, f(x) = sint 


is bounded in a neighbourhood of x = 0 since 
but f(x) has no limit at x = 0. 
The following two theorems are easy to interpret geometrically. 
Theorem 7.12. Let f(x) < ф(х) for all x in a neighbourhood of xo, except 
probably at xo, and let f(x) and (х) have limits at хо. Then (see Fig. 7.13) 


«lvx,xzo0 





sin 1 
x 





lim f(x) < lim ф(х). 


X 7* Xo 

Notice that in general from the strict inequality f(x) < ф(х) it follows that 
lim f(x) < lim e(x). 
X — Xo х Xo 


For example, let 


2 
foo =x? and ф(х) = i for x z 0, 


1 for x =0. 
Obviously, we have f(x) < ф(х) Vx whereas 


lim f(x) = lim g(x) = 0. 
x70 x20 


| y-*(x) 

| ы=Ч{т) 

І 

! у= f(x) 


1 

| 
| у= f(x) | g-v(x) 
[ 

Ї 


Fig. 7.13 Fig. 7.14 








Theorem 7.13. Let p(x) < f(x) < y(x) be true for all x in a neighbour- 
hood of x, except probably at xo, and let (х) and (x) have the same 
limit A at хо. Then f(x) also has the limit A at хо (Fig. 7.14). 

Limit of a function when the variable tends to infinity. Let f(x) be de- 
fined at every point of the number line or at all x whenever |x| > K for 
some К > 0 so that we can compute the values of f(x) for whatever large 
magnitudes of x. 
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Definition. We say that a number 4 is a limit of f(x) as x tends to 
infinity and write 


lim f(x) = A 


xo 
provided that for any ғ > 0 there exists a number N > 0 such that 
Мо) - A| < е, 


whenever |х| > М. 
Substituting x > N and x < —N for |х| > N, we have A = lim f(x) 


x> +o 


and A = lim f(x), respectively. Whence it follows that 


A= lim f(x) 


x>w 


if and only if both A = lim f(x) and A = lim f(x) hold. 
хэ + Are, 





Fig. 7.15 


Geometrically the statement A = lim f(x) means that, given any 


хә + 
(whatever narrow) strip between the lines y = A — € and y = А + e, there 
exists a line x = N > 0 such that for all x > N > 0 the graph of у = f(x) 
is contained inside this strip (Fig. 7.15). In this case we say that the curve 
y = f(x) asymptotically tends to the line y = A as x ^ + со, 


defined at all points 





Example. Consider the function f(x) = Эн i 
x 


of the number line in which case the nominator is bounded everywhere 
and the denominator increases to infinity as |x| > + оо, This prompts us 
to assume that lim f(x) = 0. 


xo 


17— 9508 
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48 Let us choose є > 0 such that 0 < e < 1. For the inequality 


-0| <E 





1 
х? +1 
to hold there must hold <e. Whence х? + 1 >t and 


x 
x| > Н -1. Thus, if we choose N= 1-1 then 


1 


шоо 0 | < є whenever |х| > N, i.e., A = Oisthe limit of the given 
x 





function as x > œ. 


Notice that the inequality 1 — 1 > 0 is true only for e < 1. Fore > 1 





it immediately follows that r- 1 < є for all x € R. 
x 


The graph of an even function y = uu asymptotically tends to the 
x 


line y = 0 as x 2 о, » | . 
Problems. We leave it to the reader to interpret the following statements 


in terms of inequalities involved in the definition of a limit 
(0 lim f()2-3 Q) lim Дх) = 1; (3 lim f(x) = 0. 
x>w x> +a x> -o 


7.4 Infinitesimals and Infinities 


Notion of an infinitesimal. Let a function a(x) be defined in a 
neighbourhood of a point xo, except probably at xo. Then a(x) is said 
to be an infinitely small function or an infinitesimal as x tends to хо if 
a(x) has at xo a limit equal to zero, ie, 

lim a(x) = 0. 

X 7 Xo 
For instance, the function a(x) = x — 1 shown in Fig. 7.16 is an in- 
finitesimal as x ^ 1 since lim (x — 1) = 0. 


х-1 

In general the function a(x) = x — xo offers an elementary example of 
an infinitesimal as x > xo. 

Applying the Cauchy criterion for limits we can define an infinitesimal 
as follows. 

A function a(x) is an infinitesimal as x > xo if given any ғ > 0 there 
exists 6 > 0 such that 

la(x)| < € 

whenever |x — xo| « 6 and x # xo. 
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Using logical symbols we write 


(a(x) is an infinitesimal as x > xo) e ve 5 0 36250 Vx, x хо, 
|x — xo| < 6 = Ja(x)| < є. 


Analogously, we can define infinitesimals as x > œ, x> +o and 
X — 0, 
We say that a(x) is an infinitesimal as x > оо if lim a(x) = 0. 


xo 


If lim a(x) = бог lim a(x) = 0 the function a(x) is said to be an 


хә to хә -@ 


infinitesimal as x ^ +оо or as x ^ — со, respectively. 





For instance, the function a(x) -1, х #0, is an infinitesimal as. 


. "| : Dye T M 
х o, since lim - = 0. The function a(x) = e^* is an infinitesimal as 
xo 
x> +o, for lim e *=0. 
x> + о 


In what follows we shall confine ourselves to infinitesimals as х tends 
to xo by leaving to the reader a generalization of theorems and concepts 
to the cases when x > o, x ^ +œ and x > — oo. 

Properties of infinitesimals. Now we shall prove some important the- 
orems which enable us to operate on infinitesimals. 

Theorem 7.14. If a(x) and B(x) are infinitesimals as x > xo so is the 
sum a(x) + B(x) as x ^ хо. 


«4 Since a(x) is an infinitesimal as x > Хо then for any € > 0, there exists 
6, > 0 such that 
E 
lex] < 2 , (*) 
whenever |x — xo| < бу and x # xo. 
17” 
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Analogously, for 8(x) we have 62 > 0 such that 
|8(х)| < 2 (жж) 
whenever |x — xo| < & and x # хо. 
Let ô = min (5, 82 }. Then both (*) and (**) hold whenever |x — xo| < à 


and x # хо. Hence 


le) + BWI < lal + [BI <5 + 


E T 
2 


Ух, XX, |x- | <6. 


Thus the sum a(x) + В(х) is an infinitesimal as x > хо. 9» 

Similar theorem can easily be proved for any finite number of in- 
finitesimals as x > хо. 

Theorem 7.15. /f a funciton a(x) is an infinitesimal as x ^ xo and if 
a function f(x) is bounded in a neighbourhood of хо then the product 
a(x)f(x) is an infinitesimal as x хо. 
«4 Since f(x) is bounded іп a neighbourhood of xo there exist numbers 
8, > 0 and M > 0 such that 


р) « M 


for all x € (xo — 6), xo + бу) where f(x) is defined. 
Consider an arbitrary ғ > 0. For an infinitesimal a(x) there exists 6) > 0 
such that, as x > xo, 
E 
M 
whenever |x — xo| < 62 and x # xo. 
Let ё = min (à;, 6&2]. Then both 


le (x)| < 


VO <M and ja(x)| < 3; 
are true whenever |x — ж| < ё and х # xo. 
Hence 


Af] = Ja 09] - F| < -М-є Ух, xXx, [x -x| < 8. 
By definition we infer that the product а(х) (х) is an infinitesimal as 
xx. Р 1 
Example. Consider the function y = x sin = (Fig. 7.17). 
The relation x sin + can be considered as the product of the functions 


a(x) = x and f(x) = sinl . The function a(x) is an infinitesimal as x > 0 


and f(x) = sin 1 is defined everywhere, except at the point x = 0, and is 
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bounded іп апу neighbourhood of х = 0. Then by Theorem 7.15 у = 
х sin 1/x is an infinitesimal as x > 0 so that 

lim x sin ирс 0. 

х-0 х 

Corollary. If a function a(x) is an infinitesimal as x > хо and if a func- 

tion f(x) has a finite limit at xo then the product a(x)/(x) is an infinitesimal 
as X > Xo. 
ча To prove this corollary it suffices to observe that a function having 
a limit at xy is bounded in a neighbourhood of хо. » 





Fig. 7.17 


Lemma. If a function f(x) has a nonzero limit at a point xo then a func- 


tion is bounded in a deleted neighbourhood of x. 


1 
F(x) |A] 
45 Let lim f(x) = A # 0. Then for any £ > 0, say € = EL there exists 
ô > 0 such that for all x 

A 
и -sw <L 
whenever 0 < |x — xo| < ô. 
Since the inequality |A — f(x)| > |A| — [f(x)| is always true we have 
A 
al- [foo] <L. 
Whence 


væl > -4L WX, XX, |x — %|< 6. 
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This means that a function is defined for all x, x z x, 


1 
Ix — x| < 6. and 70) 




















1 ы! E 2 
Дх) fœ] АГ 
Hence the function Җә is bounded in a deleted neighbourhood of 
Xo. P 
Theorem 7.16. Jf a(x) is an infinitesimal as x > xo and if f(x) has a 
nonzero limit at xo then the quotient o is an infinitesimal as x > xo. 
* Express ae) as 
б Дх) 
a(x) 1 
— LL = a(x) ——- 
fo) "9 ә 


By virtue of the above lemma the function XS is bounded in a neigh- 





bourhood of a point хо. Then, applying Theorem 7.15, we infer that the 


is an infinitesimal as x ^ xg. 9 





. 1 
function a(x) 709 
The condition lim f(x) # 0 is essential when we wish to apply Theo- 
хохо 
rem 7.16. For example, let a(x) = x and f(x) = x’. The function a(x) is 
an infinitesimal as x > 0 and the function f(x) has a limit equal to zero 
at x 2 0, ie, f(x) is an infinitesimal as x ^ 0. Evidently, the quotient 
A = 2, = 1 , x Æ 0, is not an infinitesimal as х > 0. In general a quo- 
tient of two infinitesimals is not an infinitesimal. 

Notion of infinity. Let a function f(x) be defined in a neighbourhood 
of a point xo, except probably at xo. Then a function f(x) is said to be 
an infinitely large function or an infinity as x tends to хо if, given any 
(whatever large) number M, there exists a number 6 > 0 such that 





о) > M 
whenever |x — xo| < 5 and х z хо. In this case we write 


lim f(x) = æ. 

X Xo 
We say that f(x) has an infinite limit as x > хо. We also say that a limit 
of f(x) is equal to A with the understanding that A is any (whatever large) 
finite number. 


7.4 Infinitesimals and Infinities 263 


Using logical symbols we can write 
(f(x) is an infinity as x ^ х) e vM »0 36520, Vx, XX 
Ix — %| < 6 >» [f(x)| > M. 
Substituting f(x) > M or f(x) < -M for |f(x)| > M, we have 


lim f(x) = +оо or lim f(x) = — c, 
X 7* Хо X Xo 
respectively. 


For example, the function f(x) = 1 defined at every x ж 0 is an infinity 


as x > 0 (Fig. 7.18). The function f(x) = 23 defined at every x = 0 is an 
x 


infinity tending to + о as x 0 (Fig. 7.19). 











“| 


Fig. 7.19 





Fig. 7.18 


Let us show that f(x) =i is an infinity as x ^ 0. Consider any 


шэн. > M to 
|x] 
hold it is necesary and sufficient that the inequality |x| = |x — 0| < u 


hold. So if we put 6 = 2 then |f(x)| = ar > M whenever |x — 0| = 


and x z 0. Whence we infer that f(x) — 


whatever large M > 0. For the inequality |f(x)| = 





1 
x 





1 
M 
x Q0. 


is an infinity as 


х|— 


k| < 
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To interpret the notion of infinity geometrically look at Fig. 7.20. A 
function f(x) is an infinity as x > xo if given any (whatever wide) horizontal 
strip between the lines y = —M and y = M there exist two vertical lines 
X = Xo — óand x = xo + ósuch that the graph of y = f(x), х 5 xo, lies out- 
side this horizontal strip whenever x is contained in the open interval 
(xo — ô, xo + ô). Notice that not every function f(x) which is unbounded 
in a neighbourhood of x is an infinite function as x — xo. For instance, 


the function f(x) = I sin’ is unbounded in a neighbourhood of x = 0; 


however, f(x) is not an infinity as x > 0. (As an exercise draw a figure to 
interpret this example geometrically.) 











We say that a function f(x) is an infinity as x > œ and write 
lim f(x) = © 


xw 


if, given any (whatever large) number M > 0, there exists a number N > 0 
such that 


ро) > M 


whenever |x| > N. 
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Example. The function f(x) = x is an infinity as x > œ. 
“4 Indeed, YM » 0 IN » 0, say № = М, such that |Дх)| = x| > M 
whenever |х| > N, vx. в» 

Analogously, we can define infinities as x > + оо and as x ^ — х. 

Relations between infinitesimals and infinities. The following theorems 
lay down the relations between infinitesimals and infinities. 

Theorem 7.17. If a function f(x) is an infinity as x > xo then a function 


Te -i dE 
a(x) = ——- is an infinitesimal as x > xo. 
ША, 
ч Let € > 0 be an arbitrary (whatever small), d Since f(x) is an 
infinity as x > хо then for any M > 0, say, M = — , there exists 6 > 0 such 
that 
1 
|А) > M = E 


whenever |x — xo| < 6 and x # xo, in which case there will be defined a 


function a(x) = D and 


091 = рр <и 9 
Thus 


ve > 0 36 > 0, Vx, X х, |х — xo| < 6 = Ja(x)| < =. 





Whence we infer that a(x) = X5 is an infinity as x > xo. » 


Similarly we can prove the following theorem. 

Theorem 7.18. 7f a function a(x) is an infinitesimal as x > xo and if 
a(x) is distinct from zero in the neighbourhood (xo — 6, xo + 5) of xo, ex- 
cept probably at xo, then the function f(x) — zo is an infinity as x > хо. 


Problems. Using inequalities write down the definitions for the limits: 


(1) lim f(x) = œ, (2) lim 70) = 
x1 x- 





(3) lim Дх) = -оо, (4) lim f(x) = 
(5) lim f(x) = +, (6) lim f(x) = -æ 


In conclusion of this section we shall consider the rational function 


ах" + ax”! +... + dm 
= = 0, bo ¥ 0), 
JO ара uus ОТТ? 
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which is a quotient of two polynomials in x of degrees m and n, respective- 
ly. For sufficiently large |x| the denominator is distinct from zero; thus the 
quotient makes sense. Then dividing the nominator and denominator by 
x^, we obtain 


ax" "+ ах" "l+... dax 


JU туч, ga крк" 





It is easily seen that a limit of the denominator as x — со is equal to 
bo # 0. The nominator increases infinitely if т > n, has a limit equal to 
ao if т = n or has a limit equal to zero if m < n. Therefore, the behaviour 
of this function as x > оо is fully determined by the formulas 


ах" + ах” +... + dg do 


= , т=п, 
хэ Рох" + рух"! +... + dy bo 





15 Operations on Limits 


Relationship between a function having a limit and an infinitesimal. 
We shall prove a theorem which will be frequently used below. 
Theorem 7.19. Let a function f(x) be defined in a neighbourhood of 
a point xo, except probably at xo. For f(x) to have a limit A at xs it is 
necessary and sufficient that f(x) admit a representation of the form 


f(x) = A + a(x), 


where a(x) is an infinitesimal as x > x. 
а Necessity. Let f(x) have a limit equal to A at xo, ie, 


lim f(x) = A. 
We put Я 
a(x) = f(x) – A (*) 


or, equivalently, 
f(x) = А + a(x) 
and prove that a(x) is an infinitesimal as x > xo. 
Let £ > 0 be an arbitrary number. Since lim f(x) = A, then for the 


given є > 0 there exists 6 > 0 such that 
x) - Al «e 


whenever |x — xo| < 6 and x # xo. 
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Taking into account (*), we can write 
la(x)| < в 
or 
ve > 0 36 > 0, Vx, x ж, |x — | < 6 = Jo(x)| < e. 


Whence it follows that a(x) is an infinitesimal as x > xo. 
Sufficiency. Let f(x) admit a representation of the form 


f(x) = A + a), (+*) 


where A is a constant quantity and a(x) is an infinitesimal as x > x. We 
shall prove that f(x) has a limit equal to A at xo. 

Suppose that є > O is an arbitrary number. Since a(x) is an infinitesimal 
as X — Xo there exists 6 > 0 such that 


|«ОО| < ё 


whenever |x — xo| < 6 and х # xo. 
But by virtue of (жж) we have a(x) = f(x) — A and |f(x) — A| < e. Thus 
ve > 0, 38 > 0, vx, хя х, x — xo < 6 = |x) - Al < e. 
This means that A = lim f(x). > 


хох, 
Example. Let f(x) = x and x = 2. Then lim f(x) = 2 and 
x22 
Лх) = 2 + (х ~ 2) 

where 2 is a constant quantity, x — 2 is an infinitesimal as x > 2, i.e., 
a(x) = x – 2. 

Arithmetical operations on limits. Let f(x) and (x) be defined in a 
neighbourhood of a point xo, except probably at xo. 

Theorem 7.20. Zf lim f(x) = A and lim ф(х) = B then 

х Хө 


Xx Xo 


(a) lim [f(x) x e(x)] = A + В, 
(b) lim 170) -Ф09| = АВ, 


(с) lim V9 4 provided that lim р(х) = B s 0. 
xx P(X) B X Xo 
<~ We shall prove Statement (b) of this theorem. 


Let lim f(x) = A and lim ф(х) = B. 
By Theorem 7.19 we have 
f(x) = А + о(х), ф(х) = В + BO), 


where a(x) and 8(x) are infinitesimals as x > xo. 
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Whence 
F(x) e(x) = [A + a(x)][B + 8(x)] = АВ + Ba(x) + A8(x) + а(5)809. 


By Theorem 7.15 B a(x), A B(x) and a(x)8(x) are infinitesimals as 
X — Xo. Then their sum is also an infinitesimal as x ^ хо. Thus we have 
represented the function f(x)e(x) as the sum of the constant quantity AB 
and an infinitesimal as x > хо. By Theorem 7.19 we conclude that the func- 
tion f(x)e(x) has a limit equal to AB at хо, ie, 


lim [f - e()] = AB = lim f(x); lim g(x). > 


It is easy to generalize Theorem 7.20 to the case of any finite number 
of factors multiplied. 

Corollary. A constant factor can be taken outside the limit sign. 

We leave to the reader to prove Statements (a) and (c) and to solve the 
following problems. 

Problems (1). Let f(x) have a limit at хо and let р(х) have no limit at 
Xo. Which of the following limits exists? 

(i) Hr f(x) + e(x)], Gi) im Lf(x) · Ф()]. 


(2) Let lim f(x) z 0 and let (x) have no limit at хо. Prove that 


lim [f(x) - e(x)] does not exist. 





1 x?—4 
les. (1 i 
Examples. (1) Compute lim T3 


ча Consider the given function as a quotient of two functions f(x) = 
x? — 4 and ф(х) = x + 1 each of which has a limit at x = 0, namely, 
lim f(x) = lim (х2 - 4) = -4 and lim g(x) = lim (x + 1) = 1 #0. 
x0 x70 x70 x70 

Since a limit of (x) is not equal to zero, Statement (c) of Theorem 7.20 
is applicable here. Then we have 


42 0802-9 








1 m = —4, pe 
A rq dump) 
x20 Б 
2 
i x -1 
(2) Compute lim ear 


4 Setting f(x) = x? - 1 and g(x) = x - 1, we have lim f(x) = 0 and 
E x21 
lim g(x) = 0. Thus we can not use Statement (c) of Theorem 7.20 directly. 
х-1 
To resolve this difficulty we can represent the quotient of f(x) and (x) as 


x'-1  (x-DG-« 1) 
x-1 7 x-1 | 
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Observing that the definition of a limit at a point does not involve this 

point and the value of the function at this point, we can exclude the point 

= | from consideration. This means that we can reduce the fraction by 
X — 1 (assuming that x — 1 # 0). Then we have 








2 _ 
2 D x qoe: 
х-1 
Whence 
— 1 
lim = lim (xx + 1) = 2. » 
xl х-1 нээ ) 


N14x -1 
———2—— 


ча The limits of the nominator and denominator at х = 0 are equal to 
zero, and we can not use Theorem 7.20 directly. But multiplying the nomi- 


nator and denominator by V1 + x? + 1 and assuming x = 0, we have at 


xz 
(Vi-x -DNI-x +1) _ х? 
xX(N1- x? + 1) LNI 4+ x? + 1) 


1 


Мі +2 +1. 


Then applying the quotient formula given by Statement (с) of Theorem 
7.20, we obtain 
| NI-x' -1 : 
lim 3 = lim 


1 
x0 x 2-0 у] +x? +1 


(3) Compute lim 
x0 








1 


1 E 
im(V1-x +D) 2. 
x70 


(4) Consider the function f(x) = sin x? which is an even function, de- 
fined at every point of the number line and bounded everywhere so that 
[sin x?| « 1, vx. This function vanishes at x = x nz , where n = 0, 
e 
ча Consider two consecutive points at which f (x) vanishes. Let these 
points be nz and V(n + 1)т and the distance between them be 
Vin + Ds - Упт. 

Now we compute lim (V (n + 1)т — Упт ). Multiplying and dividing 

nov 


(V(n + рт — пт) by (V (n+ Da. + лт ), we obtain 
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lim (V(n + Dr — Упт) 


no 





(Qn + 1)т – Wns)(N(n + 1)т + Улт) 
n^ V (n + 1)т + Улт 








T 


lim = 0, 
no V(n + т + na 








since the nominator of the above fraction is a constant quantity and the 
denominator increases infinitely as и > со. Thus the distance between two 
consecutive points at which f(x) vanishes tends to zero as n > œ. Hence, 
the function f(x) = sin х? is nonperiodic. » 


One-sided limits. Let f(x) be defined in an interval (a, хо). Then a num- 
ber A is said to be a /eft-hand limit of f(x) at x if, given any є > 0, there 
exists 6 > 0 such that 


f(x) - A| < е, 


whenever Xo — ô < x < ж. 
In symbols, we write 
А = lim f(x) or A = f(x — 0). 
хо Хо — 
Now let f(x) be defined in an interval (xo, b). Then a number А is said 
to be a right-hand limit of f(x) at xo if, given any є > 0 there exists 6 > 0 
such that 


f(x) — Al < 


whenever xo < x < xo + 6. 
In symbols, we write 
A- lim A? or A = f(xo + 0). 
X-7*Xo— 

Suppose that a function f(x) is defined in the neighbourhood (хо — hı, 
Xo + h2), hy > 0,h2 > 0, of xo, except probably at xo. Furthermore, let there 
exist left- and right-hand limits of f(x) at xo (Fig. 7.21). 

Then applying the Cauchy criterion for limit of f(x) at xo, we can easily 
prove that for the function f(x) to have a limit at xo it is necessary and 
sufficient that there exist left- and right-hand limits of f(x) at xo and these 
limits be equal to each other. In other words, it is easy to prove that 


Лоо — 0) = Ло + 0) = lim До). 
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< Let lim f(x) = A. Then given any ғ > 0 there exists 6 > 0 such that 


|Дх) - A| < (*) 


holds for all x (x = хо) іп (xo — 6, xo + ô). 
Evidently, (+) holds for all x in (xo — ô, xo) and all x in (xo, xo + ô) 
so that by definition we have 


А = lim mid and A= lim f(x). 
0 


хә X» — X Xo * 
Conversely, let A = lim f(x) and A= lim f(x). Then given any 
xxo—0 xx, +0 
€ > 0 there exist 6; > 0 and & > 0 such that 
If) - Al «e 


whenever xo — б| < x < xo and xo < x < xo + à. 
yi кш 


f(xg* 0) 

















О Xo 


Fig. 7.21 


мү 





Set 6 = тіп {6,, 8). Then we obtain 
|f) - Al « e 
for all x such that 0 < |x — xo| < 5. Whence it follows that 
lim f(x) = A. > 


X 7 Xo 


f 0, 
Examples. (1) Consider the function f(x) = Е shown 
1 for x=0, 
in Fig. 7.22. 
We have 


lim Дх) = lim f(x) = 02 lim f(x) = 0. 
x70-0 x70+0 x70 
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(2) Consider the function f(x) = i , X z 0, shown in Fig. 7.23. 


РЕ el/* 
We have 
lim f(x) = 1 and lim Дх) = 0 = alim f(x). 
х-0-0 х-0-0 . x70 


(3) Consider the function f(x) = e!/*, x z 0, shown in Fig. 7.24. 
We have 
lim f(x) = 0 апа lim f(x) = +æ = alim f(x). 

x70+0 x70 


х-0- 


In conclusion we suppose that f(x) is defined on a closed interval 
[a, b] or on an open interval (а, b). Then at a f(x) can have only a right- 
hand limit and at b it can only have a left-hand limit. 


| 




















HÌ 
о 
нї 





Еір. 7.23 Fig. 7.24 


7.6 Continuous Functions. Continuity at a Point 


Notion of continuity at a point. Let a function f(x) be defined in 
a neighbourhood Q of a point хо. A function f(x) is continuous at xo if 
0) f(x) has a limit at xo and (ii) the limit of f(x) at хо is equal to the value 
хо) of f(x) at хо, ie, : 


lim f(x) = fo). (9 


Since хо = lim x we can write (ж) as 


XX 


lim f(x) = (tim x). 


X 7* Xo х Хо 


In other words, for a continuous function f(x) the symbols lim and f can 
be interchanged in (9). 
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Now we give a precise “e-5” definition of continuity. 

Let f(x) be defined at хо and in a neighbourhood Q of x. Then f(x) 
is a continuous function at xo if, given any € > 0, there exists 6 > 0 such 
that 


S — ЛХ) < € 


whenever |x — xo| < 6 and x € Q. 
Using logical symbols, we can express this definition as 
(f(x) is continuous at xo) ө УЕ > 0 36 > 0, vx cQ, 


Ix — x| < 8 = |) – Л) < e. 


Notice that in general the value of 6 depends on both the number є > 0 
and thé point xo, i.e., 6 = d(€, xo). It is important to mention here that 
the "2-6" definition of continuity does not require x x xo. 

We can also give a slightly different definition of continuity at a point. 
Let y = f(x) be defined at xo and in the neighbourhood Q of хо (Fig. 7.25). 
Consider the point x = xo + Ax in Q which differs from xo by a positive 
or negative quantity Ax. The quantity Ax is called the increment of the 
argument x at Xo. Then the difference 


Ay = f(x + Ax) — f(x) 


is called the increment of the function f at xo, corresponding to the incre- 
ment Ax of the independent variable x. 


XQ аг T 
Fig. 7.25 


In terms of the increment of x the condition of continuity of f(x) at 
Xo, ie, 


lim f(x) — f(xo) 


becomes 
lim До + Ax) = f(xo) 


Ах-0 
or, equivalently, 
іт [о + Ax) — fo) = 0. (9) 


Ах-0 
Observe that f(x + Ax) — f(xo) = Ay. Then (**) сап be written as 
lim Ay - 0, 


Ах-0 


1 


18—9505 
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which offers another definition of the continuity of y = f(x) at the point 

Xo: the function f(x) is continuous at x € Q if the increment of f at xo, 

corresponding to the increment Ax of x, tends to zero as Ax > 0. 
Example. Let us show that the function y — x? is continuous at every 

point of the number line. 

«8 Indeed, given any increment Ax at Хо we have 


Ay = (х + Ax)? — x, = 2хоАх + (Ах)? = (2% + AX)Ax. 


Whence it follows that Ay > 0 as Ax > 0. This implies that y = x? is con- 
tinuous at every point хо of the number line. 9» 

The definitions given above are equivalent to each other and any of 
them can be applied when suitable. 

In many cases it is convenient to apply the definition of continuity at 
a point which stems from the sequential criterion for limits. Let f(x) be 
defined on any set E of real numbers and let xo € E. A function f(x) is 
continuous at xo if for any sequence {Xn}, x4 € E which converges to xo 
the corresponding sequence { f(x;)] of the values of f(x) converges to (xo). 

Example. Prove that the Dirichlet function 


ро) = 1 for raona] х, 
0 for irrational х 


is discontinuous at every point. 

48 let хо be any irrational number. Then D(x») = 0 and whatever хо there 
exists a sequence {rn} of rational numbers, converging to xo. By definition 
D(rn) = 1 Vn; so the sequence {D(r,)} = (1) converges to 1 so that 
(Г(га)| ? D(xo). This means that D(x) is not continuous for irrational 
x. By similar reasoning we can easily verify that D(x) is discontinuous for 
all rational x. 9» 

It is not hard to show that the “г-6” definition of continuity at a point 
and the definition stemming from the sequential criterion for limits are 
equivalent. 

Properties of functions continuous at a point. The following two the- 
orems describe the important properties of continuous functions. 

Theorem 7.21. If a function f(x) is continuous at a point хо and if 
Љо) > А (f(xo) < A) then there exists ó > 0 such that f(x) > A (f(x) < A) 
for ail x in the open interval (xo — ô, xo + ô). 
<4 For definiteness, we assume that f(xo) > A. Then 


SQ) = А + А, 
where й > 0. 


Let € = 2 . Since f(x) is continuous at Хо there exists 6 > 0 such that 
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for all x 
б) - fol < Ë 
or 


h h 
75 « f(x) ~ fe) < 5 


whenever |x — xo| < 6. Whence 
h h h 
Јо) > Лю) - =A *h-y =А+у. 

Thus, f(x) > A, Vx€(xo — ô, Xo + ô). > 

From the preceding theorem the following theorem can be derived. 

Theorem 7.22. Let f(x) be continuous at хо and let f(xo) # 0. Then there 
exists a neighbourhood (xo — 6, xo + 5) of xo such that f(x) does not vanish 
and is of the same sign at every point of the neighbourhood. 

«8 To prove this theorem it suffices to put A = 0 in Theorem 7.21. » 
Continuity of elementary functions. Basic elementary functions are? 
(a) power function y = x“, where а is an arbitrary real number, x > 0; 
(b) exponential function y = à, a>0,a#1, -0 < х < + о; 

(с) logarithmic function y = loge x, a > 0, az 1, x 5 0, 

(d) trigonometric functions y = sinx, – о < x < +œ, y = cosx; 
—ocx« +œ, у = tanx, x #5 + пт, п = 0, +1, +2,...,y = сох; 
хэ пт, п = 0, +1, +2, ...; 

(e) inverse trigonometric functions у -sin !x, -l<x<l, у=. 
cos !x, -1<х<1, y-tan'!x, -о«х« +m, y-cot х, 
—oc«x« +0, 

We say that elementary functions are those obtained by means of a finite 
number of arithmetic operations and a finite number of operations of con- 
structing a function of a function. It is not hard to show that the basic 
elementary functions are continuous at every point of their domains. 

We prove that the function y — cos x is continuous at every point of 
the domain – о < x < +œ. 
ча First we prove the auxiliary inequality 


|sin x| < |x| vx. (*) 


Look at a circle of unit radius (Fig. 7.26). Let a radian measure of the 
т 
2 
AOC. Evidently, the length of the segment BC is equal to 2 sin x and the 


angle AOB be x, 0 < x < = and let the angle АОВ be equal to the angle 


Э A detailed account of basic elementary functions is given in the Appendix. 
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length of the arc BÓ is equal to 2x. Since the arc length is greater than 
the length of the subtending chord we have 


2 sinx « 2x and sinx « x. 
This inequality can be written as |sin x| « |x| for x € (0, 1/2). The identities 
[sin (—x)| = | -sinx| = |sinx| and | — x| = |x| imply that the inequality 
[sin x| < |x| also holds for x € (— 7/2, 0). Observe that (*) is also true for 
all xe(— 2/2, т/2) since sinO = 0. If х 7/2, 7/2) then |x| > т/2 > 1, 
whereas |sin x| & 1 vx. Thus we conclude that the inequality 


[sin x| < |х| 


holds for any x. 

Now we turn to the function у = cos x which is defined at each point 
of the number line. Let x be an arbitrary point (x € IR) and Ax be an incre- 
ment of x at x. Then the increment of y = cosx can be expressed as 


: AXN . Ах 
Ay = cos (x + Ax) — cosx = —2 sin (х + F) sin ——. 











2 2 
Whence 
Ax Ax 
22222 Ах\. Ах 
|Ay| sin (х +5 ) sin — 
= 2 | sin (х + X) | sin 25 А (**) 
с 
8 
B N 
9 AN A 
С 
Fig. 7.26 Fig. 7.27 
: Ах : 
Recall that | sin (5 + =) < 1 vx, vAx and, by virtue of (*), 
| sin 25 < Де Thus we can write (**) as 
Ax 
lay] < 2:1. 2. = jax 


22 


and 
0 < |Ay| < |Ax|, 


where Ay is a function of Ax for a given x. 
Whence, by virtue of Theorem 7.13, we obtain 
lim Ay = 0 
Ах-0 
which means that the function y — cos x is continuous at every point of 
the number line. » 


"ME ? 1 sin x m : 
Remarkable limits. We consider the relation E which is a function 


of x defined at any x x 0 and prove the following theorem. 
Theorem 7.23. If x is measured in radians then 
. sinx 
lim =] 
х 


x20 





-« Suppose that O < x < «/2 and consider a circle of unit radius 
(Fig. 7.27). From Fig. 7.27 it is easily seen that 


area of AOAB < area of sector OAB « area of AOAC. 


These areas are equal to 1 sin x, 1 x and І tan x, respectively. Then we have 


2 2 2 
sinx «x < tanx, x€(0, 1/2). 


Dividing all the terms of these inequalities by sin x > 0, we obtain 


Whence 
sin x 


1» ЗГ > cosx (x) 


which holds for x € (0, 7/2) and also for ХЄ(-тл/2, 0) since 


sin(-x) _ sinx 
-x 





and cos(— x) = cos x. 


The function y = cosx is continuous at any x and, in particular, at 
X = 0 so that 


lim cos x = со$ 0 = 1. (жжж») 
х-0 


Thus both y(x) = cos x and y(x) = 1 have the limits equal to 1 at x = 0. 
By Theorem 7.13 (***) and (иж) yield the identity we are seeking for. В» 
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In the preceding sections we have proved that 


1 n 
lim ( +1) = 6. 
E n 


Denoting 1/n = z we have n = 1/z and z > 0 as n > œ. Then 


lim (1 + z)'* = e, 
1-0 


where z = 1, 1/2, 1/3, .... 


We can verify that there exists lim (1 + х)!“ 
x70 


which is also equal to the 


number e when x tends to 0 in an arbitrary way by running through any 
sequence of real values distinct from 0, i.e., 


lim (1 + х) = e. 
x0 


Operations on functions continuous at a point. In this section we shall 
develop a number of important properties of continuous functions. 
Theorem 7.24. Let f(x) and ф(х) be defined at хо and in а neighbour- 
hood of x. If f(x) and p(x) are continuous at Хо so are the sum f(x) + ф(х), 
the difference f(x) — p(x), the product f(x) e(x) and, if e(x) x 0, the quo- 
fx 
p(x) ` 


By way of illustration we shall prove the quotient rule; all the other 
rules can be proved in a similar way. 
* Let f(x) and (x) be continuous at xo and let ф(х) z 0. By Theo- 
rem 7.22 there exists a neighbourhood of xo such that у(х) # 0 for all xin this 


tient 





neighbourhood. Thus, the function F(x) = 70) is defined at хо and in 





a neighbourhood of хо. Since lim f(x) = Дх) and lim ф(х) = ф(ж) # 0 


the quotient rule (see Theorem 7.20) is applicable here and we have 








lim f(x) 
: ooa SO) | хэв £09) _ 
ie OS QOO lime) eU) “°°? 
Thus lim F(x) = Ғ(хо), ie, the function F(x) = ге is continuous 


at x. » 

Composite functions. Let E be an arbitrary subset of a number line 
and let и = ф(х) be a function defined on Е. We denote by Е; a set of 
values of u corresponding to the values of x in E. Furthermore, let there 
be defined a function y = f(u) on Ej. Then every x € E corresponds to a 
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certain и € Е which in turn is associated with a certain value of y = f(u) 
(Fig. 7.28). Thus, the value of y is a function of x which is defined on 
E. In this case we say that y is a composite function of x and write 


y = fleo]. 


For instance, if и = sin x and y = e" then y = e^"* is a composite function 
defined at all x. 


y=F(u) =F (х) 





Fig. 7.28 


Theorem 7.25. Let и = ф(х) have a limit equal to A at хо and let 
y = f(u) be a continuous function at the point и = A. Then a composite 
function y = f [e(x)] has a limit equal to f(A) at хо. 
“4 We have to prove that lim f[e(x)] = f(A), ie, 
X 7* Xo 
ve > 0 36 > 0, vx, x Z Xo, x - X| < = |Ле()] — Д(А)| < e. 


Let е > 0 be an arbitrary number. Since f(u) is continuous at и = A 
then for the given є > 0 there exists a number у > 0 such that 


Iu) – ХА) < е (+) 


for all u whenever |u — A| < 7. 
By definition A = lim ф(х) so that for any у > 0 there exists 6 > 0 


Х-9 Ху 
such that |e(x) — A| < у or, equivalently, |; — A| < у for all x z xo 
whenever |x — xo| « 8. 
The inequality |u — A| < у implies (ж) which can be written as 


ИМебдї-ЛА) < е. 
Thus, for апу € > 0 there exists 6 > 0 such that 
Иебд-ЛА) < е 
whenever |x — xo| < 6 and x z хо. 
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This means that the number f(A) is a limit of the composite tunction 


Slex) at ж. » 
We see that when the conditions of Theorem 7.25 are fulfilled we have 


lm Sie) = f(A) 
or, equivalently, 


nm fie] = Літ e(x)]. 


The latter identity indicates the rule of computing a limit of a composite 
function. 


Example. Show that Tm in EE y = 1. 
ч Observe that in шэн In(1 + x)" The function 


y = In (1 + х)!“ is a composite function made up of the functions y = Inu 
and и = (1 + x)!”*. Since lim (1 + x)!“ = e and y = Inu is continuous at 
x0 


u — e. Theorem 7.25 gives 


Е 1 7 : 
jin B. limIn(1 + x) = In[lim(1 + х) = Ine-1. > 
xo x xo x0 
Theorem 7.26. Let и = ф(х) be a continuous function at хо and let 
y = f(u) be continuous at uo = e(xo). Then a composite function 
y = fle(x)] is continuous at xo. 
ча Since и = ф(х) is continuous at xo it has at x a limit equal to 
v (Xo) = uo. Besides, y = f(u) is continuous at ио. Then Theorem 7.25 im- 
plies that the composite function y = f[y(x)] has at xo a limit equal to 
(uo) = Ле) or 
lim f[e(x)) = flpo). 
X 7* Xo 
Whence we infer that a composite function f/[o(x)] is continuous at хо. 9 


Discontinuities of functions. Let f(x) be defined at a point xo and in 
a neighbourhood of хо. If f(x) is continuous at x then 


lim flx) = Лоо) 
or, in terms of one-sided limits, 


lim fe um. (70) = fo). (*) 


X — Xo — 
Thus the pee f(x) is continuous at the point xo if and only if there 
exist the left-hand and right-hand limits of f(x) at xo these limits being 
equal to each other and to the value of f(x) at хо. 
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Definition. The function /(x) has a discontinuity at the point xo if f(x) 
is not continuous at хо and хо is called the point of discontinuity”. 

The discontinuities of a function can be classified according to the sense 
of violating the condition (*). 

Definition. A point хо is called a removable discontinuity of f(x) if f(x) 
has the left- and right-hand limits at xo which are equal to each other but 
are different from the value of f(x) at xo, i.e., 

lim Дх) = lim f(x) # f(x). 
хэхо- 0 хох +0 

When хо is a removable discontinuity it suffices to modify the function 
f(x) only at the point xo so that f(x) becomes continuous at хо. That is, 
if xo is a removable discontinuity of f(x) then the function 
F(x) = Ах) for x x Хо, 

lim f(x) for x = xo 

X 7* Xo 
is continuous at xo. We have removed a discontinuity by changing the value 
of f(x) at one point, хо. 











У 
* f(x) 
1 
0 X 0 
Fig. 7.29 Fig. 7.30 


Example. Let 


_ (|x| for x = 0, 
195 F for x = 0. 
44 We have 
іт Дх = lim f()02027z1 = f0), 
x—^x,—0 xxj*0 


so that the point x = 0 is a removable discontinuity of f(x) (Fig. 7.29). 
If we modify f(x) at x = 0 by putting /(0) = 0 then we shall get a function 
F(x) = |x| which is continuous at x = 0. » 


? If f(x) is not defined at хо the point xo is also called the point of discontinuity. 
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In general, if a function f(x) is continuous on an open interval (xo — ài, 
Xo + 62) everywhere, except at xo, and if хо is a removable discontinuity 
then the graph of f(x) is represented by a continuous curve from which 
the point corresponding to the abscissa xo is deleted and is replaced by 
the point of f(xo) whose ordinate is different from the ordinate of the re- 
moved point (Fig. 7.30). It is important to mention that if xo is a removable 
discontinuity then at хо there exists lim f(x). 


X — Xo 


We say that xo is an unremovable discontinuity of f(x) if lim f(x) does 
X -* Xo 


not exist at xo. 

Definition. If the left- and right-hand limits of f(x) at xo are finite and 
are different, ie, if 

lim f(x) # lim Дх), 
хохо 0 хох +0 

then the point хо is a discontinuity at which f(x) has a Jump (ог a gap) 
and f(xo) does not need to be equal to either of the one-sided limits. In 
this case the jump of Дх) at xo is equal to the difference 
Ло + 0) — f(xo — 0) of the left- and right-hand limits of f(x) at хо. 











Ht 





Fig. 7.31 


Example. Consider f(x) = Lee ДО) = 1 (Fig. 7.31). 


ч At x = 0 the function f(x) has a jump since 
lim Дх)=2 and lim Дх)=0. > 
x70-0 x2040 
A. removable discontinuity and a discontinuity at which the function 
has a jump are called the discontinuities of the first kind. If xo is a discon- 


tinuity of the first kind the function f(x) has finite left-hand and right-hand 
limits at xo. All other discontinuities are called discontinuities of the second 
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kind. Thus, if x is a discontinuity of the second kind the function f(x) 
at xo does not have either a finite left-hand limit or a finite right-hand limit. 


Examples. (1) If f(x) = 1 , x z 0, ДО) = 0, the point x = 0 is a discon- 
tinuity of the second kind, for lim f(x) = -œ and lim f(x) = +, 
x20-0 х-0-0 


(2) Let f(x) = sin} x 0, f(0) = 0. 


ча At x = 0 f(x) has neither finite nor infinite one-sided limits. We can 
easily prove this by applying the sequential criterion for limits. Thus the 
point x = 0 is a discontinuity of the second kind. » 
(3) For the Dirichlet function 
1 for rational x, 
DOS ү 
0 for irrational х 
any point xo is a discontinuity of the second kind. 
We say that a function f(x) is continuous at a point xo from the right if 


lim 6/00 = f(x) ог Дх + 0) = f(xo), 


x Xo 
and f(x) is continuous at xo from the left if 


lim A9 = f(x) ог Дж – 0) f(xo). 


хә Xo — 


7.7 Continuity on a Closed Interval 


Definitions. A function f(x) is said to be continuous on an open 
interval (a, b) if f(x) is continuous at every point of (a, b). We denote 
a set of all functions continuous on an open interval (a, Р) by C(a, Б). 

A. function f(x) is said to be continuous on a closed interval [a, b] if 
f(x) is continuous on an open interval (a, b) and is continuous from the 
left at b and from the right at a. We denote a set of all functions continuous 
оп [a, Б] by Cla, b]. 

Properties of functions continuous on closed intervals can be estab- 
lished with the help of the following theorems. 

Theorem 7.27. Let f(x) be continuous on a closed interval (а, b] and 
let f(x) and f(b) be of opposite signs. Then there exists at least one point 
in (a, b) at which f(x) vanishes. 
atb 

2 
point of [a, b]. ІЁ) = 0 we get what we are seeking for, i.e., the theorem 
is true. Let f(£) ;# 0. Consider the closed intervals [a, £] and [£, b]. One 


-« Let f(a) and f(b) be of opposite signs. The point £ = is a mid- 
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of these intervals denoted by (аг, bı] is such that f(x) assumes the values: 
of opposite signs at its endpoints. Clearly, the point & = du MR 5 b becomes 
the midpoint of (аг, bi]. If f(£) = 0 the theorem is proved. However it 
may occur that Д) # 0. Then we choose between the closed intervals 
[a:, &\] and [, bı] the one at whose endpoints f(x) assumes the values 
of opposite signs, denote this closed interval by |, bz], and repeat the 
process as above. 

It is clear that if we continue this process we finally either arrive at 
a point a € (a, Б) where (о) = 0, thus completing the proof of the theorem, 
or we obtain an infinite sequence of the nested closed intervals 


la, b] D (а, JD... D [a Dx] .... 


All the closed intervals tend to zero as n > со, ie, 
b-a _ 
2 





lim (bn — an) = lim 


and on every closed interval f(x) assumes the values of opposite signs at 
the endpoints. 

Recall that the Cantor lemma implies that there exists a unique point 
a contained in each йл, Б], n = 0, 1, 2, ... . Let us prove that f(a) = 0. 

We assume the converse, i.e., f(a) z 0, and for definiteness we set 
f(a) > 0. Since f(x) is continuous at a € [a, b]. Theorem 7.22 implies that 
there exists an open interval (a — ô, а + ô) in which f(x) is positive. As 
a = lim a, = lim 5, we can choose sufficiently large n so that the closed 


п со п о 
interval [an, bn] is contained in the open interval (а — 6, а + ô). This means 
that f(a,) and f(bn) are of the same sign (both positive). However on every 
closed interval (ал, bn] f(an) and f(b,) are of opposite signs. Thus we have 
a contradiction which makes our assumption of f(a) x 0 false. Hence, 
f(a) = 0. Notice that a < о < b, a є [a, b] and the point a must be distinct 
from both a and b since f(a) 50 and f(b) = 0. » 

It is easy to interpret this theorem geometrically (Fig. 7.32). . If 
Да) f(b) < 0 then the points A(a, f(a)) and B(b, f(b)) occur on different 
semiplanes relative to the x-axis, and the graph of the continuous function 
f(x) intersects the x-axis at least at one point between a and b. 

It is important that f(x) must be continuous of [a, b]. If the function 
is discontinuous at a point in [a, b] it may have a jump from a negative 
value to a positive one without assuming a zero value as does the function 


-1 fo -l<x<0O, 
fo = { 1 fo 0<х<1, 


shown in Fig. 7.33.. 
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To illustrate how Theorem 7.27 works we consider the polynomial equa- 
tion of odd degree with real coefficients 


Pon 1(X) = ax? *! + ax? +... + ау+ = 0. 


ч Let a > 0. For sufficiently large absolute values of x P», + 1(х) becomes 
negative when x is negative and positive when x is positive. Since the poly- 
nomial is everywhere a continuous function of x, P», + ;(x) necessarily van- 
ishes at some intermediate point of its domain when changing its sign from 
negative to positive. Hence any polynomial of an odd degree with real 
coefficients has at least one real root. » 


yi y 


f(b) 








о 
аа 
E 


mi 








Fig. 7.32 Fig. 7.33 


We can apply Theorem 7.27 when we wish to find out whether a poly- 
nomial has a real root and, if any, to compute an approximate value of 
the root. 

Example. Consider the polynomial Р(х) = x? + x — 1. 
ча Since Рз(х) is of odd degree it has at least one real root. 

Evidently, we have P3(0) = —1 < 0 and Р;(1) = 1 > 0, i.e., at the end- 
points of (0, 1] Р(х) assumes the values of opposite signs. Hence P3(x) 
has a real root contained in (0, 1). 

At the midpoint £; = 1/2 of the closed interval (0, 1] we have 
P4(1/2) = —3/8 < 0. Recall that P3(1) > 0. This means that the desired 
root is contained in the open interval (1/2, 1). Again we compute the value 
of P3(x) at the midpoint of [1/2, 1], ie, at & = 3/4. We obtain P3(1/2) «0 
and P;(3/4) = 11/64 > 0. Hence the root of Р(х) is contained 
in (1/2, 3/4). Working through this process we obtain a sequence of the 
nested open intervals of decreasing lengths. In other words, at each step 
of this process we improve the approximation of the root whose precision 
is given by the length of the last open interval thus obtained. » 
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Theorem 7.28 (intermediate value theorem). Let f(x) be continuous on 
[a, b] and let f(a) = A and f(b) = B. If С is any value between A and 
B then there exists at least one point а € [a, b] such that f(a) = C, ie, 
if f(x) is continuous on [a, b] it assumes all intermediate values between 
Да) and f(b). 
ч Consider the function ф(х) = f(x) — C. 

For definiteness we assume that A < B and A < C < B. Clearly, ф(х) 
is continuous on [a, 5] and 


e(a) = Ҳа) - C-A- C«0, 
e(b) = f(b) - C- В – С> 0. 
By virtue of Theorem 7.27 there exists a € (a, b) such that g(a) = 
Да) - C = 0. Hence f(a) = C. » 
Theorem 7.29. If a function f(x) is continuous on |a, b] then f(x) is 


bounded on [a, b], ie, there exists a number K » 0 such that for all 
x€[a, b] 


VOI < К. 


Remark. If a function f(x) is continuous on an open interval (a, b) 
or on half-open intervals (a, Р| or [a, b) then f(x) is not necessarily bound- 
ed on the respective interval. For example, f(x) = 1/x is continuous on the 
half-open interval (0, 1] and is not bounded on it. 

Now we suppose that f(x) is defined and bounded on a set E. We say 
that a supremum of the set of values of f(x) ón E is a supremum M of 
the function f(x) on E, ie, 


M = sup f(x). 
xeE 


Similarly, an infimum m of f(x) on E is 
m= inf f(x). 
xeE 
Theorem 7.30. If a function f(x) is continuous on [a, b] then f(x) attains 


its supremum and its infimum on [a, b], ie, there exist numbers Ё and 
n in [a, b] such that 


ДЕ) = т = int Ло), Хт) = М = sup Ло), 


and 


JE) < f(x) < Ла). 


It is important here that f(x) is continuous on [2, b]. For instance, 
f(x) = x is continuous and bounded on (— 1, 1) but does not attain its 
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supremum sup xX = 1, ie, there exists no xo€(-—1, 1) such that 
хє( – 1, 


f(xo) = 1. Analogously, f(x) does not attain its infimum inf х= -l. 
xe(- 1,1) 


Example. Consider f(x) = x — [x] on (0, 1] (Fig. 7.34). 
ча Here sup f(x) = 1 is unattainable on (0, 1], since f(x) is not continu- 
x€ [0,1 


ous on (0, 1]. » 





+ : - 
Fig. 7.34 


We call the supremum of f(x) the absolute maximum of f(x) on 
[a, Б] and the infimum of f(x) the absolute minimum of f(x) on [a, b]. 
Then we can express Theorem 7.30 in a different way. 

Theorem 7.30'. If a function f(x) is continuous on [a, b] then f(x) at- 
tains its absolute maximum and absolute minimum on [a, b]. 

Uniform continuity. Functions continuous on closed intervals possess 
an important property of uniform continuity. 

Let f(x) be continuous on (а, b). Then given any xo € (a, b) and any > 
number € > 0 there exists a number 6 > 0 such that |f(x) — f(x) < € 
whenever хє (a, b) and |x — xo| < ô. 

The value of ё may depend both on € and on xo, i.e, 6 = (€, xo) so 
that it may happen that for a given є > 0 6 can be different for different 
хк € (a, b) and there exists no 6 which will work for all xx € (a, b). The 
requirement that there exist б = (£) > 0 for all xx is stronger than the re- 
quirement that f(x) be continuous on (a, b). 

Definition. A function f(x) is said to be uniformly continuous on an 
open interval (a, b) if for every ғ > 0 there exists ô = 6(£) > 0 such that 


fx’) - Лх") < е 
whenever |x’ — x”| < ô and x’, x" є (а, b). 
Using logical symbols, we can write 


(f(x) is uniformly continuous on (a, b)) e ve > 0, 
36 = 6(€) > 0, wx’, x" Є (а, Б), 
Ix’ -x"| «6» |x) - fx") < e. 
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It is important here that for every = > 0 there exists 6 > 0 such that 
f(x’) — fx Э < € for all x’, x” belonging to (a, Б) only under the condi- 
tion |x’ — x"| < ô. For example: the function f(x) = x is uniformly con- 
tinuous on the number line. To verify this it suffices to put 6 = є. 

It is easy to see that if f(x) is a uniformly continuous function on 
(a, b), it is continuous at every point x € (a, b). The converse is not true. 


For instance, f(x) — sin х is continuous on (0, 1) and is not uniformly 





continuous on (0, 1). Indeed, let x; — L and x; = ALT Then we can 
choose n so that the value of 
НЬ 1 _ 2 Е 1 
й n 2п +1 п(2п + 1) 








becomes smaller than апу 6 > 0 whereas 
Џо) — fo) = 1> е ve<l. 
Thus for € > 0, say for € = 2 and for any ó > 0 there exist points 
Xn and ху in (0, 1) such that [xz — x» | < 6 and |Д) — ДХ) > e. Hence 


f(x) = sin E is not uniformly continuous on (0, 1). 


Example. The function f(x) = 1 is continuous оп (0, 1) but is not uni- 


formly continuous on (0, 1). 


ча Lex, = 1 and x; = T cu as where ғ > 0 is an arbitrary number. 
Then x; – x4 | = EOR which can be made smaller than any ô > 0 
n(n + 2e) 


by choosing eae large n. However |f(x4) — f$ )| = 2e > e ve > 0. 
Hence f(x) = — is not uniformly continuous on (0, 1). 


The кее theorem is worth mentioning. 
Theorem 7.31 (due to Cantor). Zf a function f(x) is continuous on a 
closed interval [a, b] then f(x) is uniformly continuous on (а, b]. 


7.8 Comparison of Infinitesimals 


Definitions. Let a(x) and 8(х) be infinitesimals as x > хо. Then 
(a) a(x) is an infinitesimal of higher order than B(x) if 





ime 


ron BX) Б 
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In symbols, we write a(x) = o(8(x)), x ^ хо, to mean that o(8(x)), 
x> Хо, is an infinitesimal of higher order at хо than an infinitesimal G(x) 
at xo. 

For example, a(x) = х? and B(x) = x are infinitesimals as x > 0. Then 


x0 B(x) x20 X x0 


Whence a(x) = o(8(x), x ^0, or х? = o(x), x 0. 
(b) a(x) and 8(x) are infinitesimals of the same order if 
. a(x) 
lim ——— = С #0. 
roe В(Х) 
For example, a(x) = 2x and 8(x) = x, x > 0 are infinitesimals of the 
same order since 
- a(x) Ae л2Х 
lim —— = Шт =2. 
вєг B(x) iso x 
(c) a(x) and B(x) are incomparable infinitesimals as x > xo if the ratio 
a(x) 
B(x) 
a(x) = x sin І апа В(х) = x are incomparable since the ratio ao = 
x 809) 


sini has no finite limit at x = 0 and is not an infinity as x > 0. 


has neither finite nor infinite limit as х > x. For example, 


(d) a(x) is an infinitesimal of the mth order, where m is a positive in- 
teger, relative to the infinitesimal B(x) = x — xo as x > xo if 


lim- 9 Reo. 
мэс, (x 2 хо)" 
For example, a(x) = 3 sin’ x is an infinitesimal of order т = 2 relative 
to B(x) = x as x > 0 since 
. 3 біп? х 
lim түре F 
x70 x 


= 3. 


Equivalent infinitesimals. The infinitesimals a(x) and B(x) as x > xo are 
said to be equivalent if at the point xo 


(in КО 


X — Xo B(x) i 


Thus equivalent infinitesimals are infinitesimals of the same order. 


l. 


19—9505 
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We write 
a(x) ~ B(x), x > xo 


to mean that a(x) and 8(x) are equivalent infinitesimals. 

Sometimes equivalent infinitesimals a(x) and 8(x) are called asymptoti- 
cally equal as x > хо. 

Remark. Let a(x), 8(x) and у(х) be infinitesimals as x > хо. It is not 
hard to verify that 

(1) a(x) ~ a(x) at x ^ xo; 

(2) if a(x) ~ B(x), then B(x) ~ a(x) at x ^ Хо: 

(3) if а(х) ~ B(x) and B(x) ~ ф(х), then a(x) ~ y(x) at x > xo. 

Thus the equivalent relation is a reflexive, symmetric and transitive re- 
lation. 

Examples of equivalent infinitesimals. In the previous sections we have 
proved that 


sin x 


lim ——— = 1 ә sinx~ х, x20 
x70 x 
and 
тула а l=In(l+x)~x, x0. 
x70 x 


It is easy to verify the following: 





. tanx 

lim = 1 ә ќапх ~ х, х 0, 

x70 

sn^!x 

lim ——— = | ә ѕіп-іх- x, x0, 
x0 x 

. tan !x n 

lim ———— 21»2tan !x- x, хә б. 
x0 x 


By way of illustration we show that 
— 1 





. a 
lim 
x0 


—-]na,(a»50,az1).- 


я Put а-1-у so that y>0 as x0, and а'=1+у, 





x= IU E Whence 
Ina 
ig ie am ae 
298. ОХ yoo ҺИ +y)  y-oln(l + у) : 
Ina y 


Thus a2 – 1 ~ x Ina, x> 0. 
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In particular, putting a — e, we get 


x 





lim 2125e€-1-x, x20. » 
x70 
P-] 
Let us also verify that lim er» e = p where p is a real number. 
x0 


ча Put (1+ x) - 1 = y so that y 0 as x20 and (1+ х)? = 1 + y. 
Whence 
pin(l + x) = In(1 + y). 


Thus we can write 





ачжчх/”-1 y | y . pln(l +x) 
A. ^x (1+ у) x Е 
Working through the limit as x ^ 0 (y > 0), we obtain 
2 1-Хх97-1 . y . pnl +x) 
li rA . E UIROS M о Ira m 
а x а In (1 + y) 10 x Р 


Hence, (1 + х)? – 1 ~ рух, x> 0. > 
The following list of equivalent infinitesimals is extremely helpful to 
work through a variety of topics discussed in calculus: 


пх ~ x, x0; 
tanx ~ x, x 26; 

sn !x-x, x 70; 
tan !x ^ x, х 0; 

In(] +x) ~x, x70; 
а — 1l ~ xlna, x> 0; 
е- 1-х, хә 0; 

(A +x- 1 ~ рх, x 0. 


Remark. If there exist а nonzero number а and a positive integer т 
such that f(x) ~ ax", x > 0 we call ax" the principal (power) asymptotic 
term of the function f(x) as x ^ 0. The right-hand terms of asymptotic 
equalities in the list presented above are the principal (power) terms of their 
left-hand terms. 

Theorem 7.32. Let a(x), В(Х), ол(х) and 81(х) be infinitesimals as x ^ Хо 
and let a(x) ~ ол(х) and B(x) ~ B(x). If there exists a finite or infinite 
limit of eo 

8(x) 
by a(x) and B(x) is replaced by 81(х). 





at xo this limit remains unchanged when a(x) is replaced 


19* 
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: a (x) . 
ча Consider Bree) and express it as 
a(x) | a(x) | a(x) . B(x) (9) 





8x) ох) BWO Bix)” 
By the. hypothesis of the theorem 





от (x) B(x) 
m — — —] and lim = |] 
E a(x) e ftx) 
Let A be a limit of P at хо. Then Theorem 7.20(b) gives 


ex) (2000) „ a) ,. 80) _ 
imo) Оу Bo) PU BI 


























Whence we infer that the ratio m also has a limit equal to A at xo. 
1 
If the ratio rx is an infinity as x > xo then the right-hand side of 
a(x) -—— 
(*) and, consequently, f) are also infinities as x > xo. 9» 
1 
Example. Compute па i m 
o (1 + x^) 
ч Applying Theorem 7.32 and the list of equivalent infinitesimals, we 
have e 
2 sin? X 222 
1-cosx  , 2 н AN NETS 
хэд In(l x?) x-0 x! xo х? 2' 


Theorem 7.33. For the infinitesimals a(x) and B(x) as x ^ xo to be 
equivalent it is necessary and sufficient that the difference of a(x) and 8(x) 
as x> хо be an infinitesimal of higher order than a(x) and B(x). 
ча Necessity. Let a(x) and B(x) be equivalent infinitesimals as x > хо. We 
prove that the difference у(х) = a(x) — B(x) as x > хо is the infinitesimal 
of higher order than 8(x) and, consequently, a(x). 

Indeed, by the hypothesis of the theorem we have a(x) ~ B(x), x > xo 
so that lim аб) = 1. Whence 


X Xo B(x) 


y(x) .. «a(x)-B(x) _ ,. a(x) | = 
mu S aay e) eR 


This means that y(x) is the infinitesimal of higher order than B(x) as 
X — Xo. ` 
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Sufficiency. Let y(x) = a(x) — B(x) be an infinitesimal of higher order 
than the infinitesimals a(x) or B(x) as x ^ хо. We prove that a(x) ~ B(x) 
as X ж. 

Indeed, as stated у(х) is an infinitesimal of higher order than B(x); so 





2 YO _ 
um 8C) 7 
Whence 








. a(x) .. Ве) + у(х) yo) 
И EGY aA aay л ( i 2) a 
which implies that a(x) and 8(x) are equivalent infinitesimals as x > xo. 9 

Example. The functions a(x) = x + 2x? and G(x) = x are infinitesimals 
as x — 0. Then the difference y(x) = 2x? is an infinitesimal of higher order 
than a(x) and 8(x) as x ^ 0 and, consequently, a(x) ~ B(x), x > 0. 

We leave it to the reader to solve the following problem. 

Problem. Let 6:(x), 82x), ... Bm(x) be infinitesimals of higher order 
than a(x) as x > хо. Prove that the sum o(x) = a(x) + Bi(x) + 8(х) +.. 
. + = х m(x) is an infinitesimal equivalent to a(x) as x > xo. 

Landau symbols. Let f(x) and (x) be defined in a neighbourhood Q 
of xo, except probably at x, and let ф(х) z 0 in a neighbourhood бо of 
Xo, X # Xo. (Notice that xo is a finite or infinite point.) 

We say that f(x) is an infinitesimal relative to ф(х) as x > xo and write 


f(x) = о(е(х)) if 


f(x) 
Бин ex) | 





In particular, f(x) = o(1), x ^ xo, means that f(x) is an infinitesimal 
as X хә. 

Examples. (1) x? = o(x), x ^ 0. 

(2) x = o(x?), x ^ œ. 

In general, x? = о(х®), x ^ +œ, a < В, and x? = o(x*), x ^ 0 + 0, 
а > В. 
5 We write f(x) = О(ф(х)), x ^ xo if there exist a number М > 0 and a 
neighbourhood бо of хо such that 


V| < М|) О0| vx€Q9o, x хо. 


Thus f(x) is bounded relative to (x). 

The notation f(x) = О(1), x ^ xo means that the function f(x) is bound- 
ed in a neighbourhood of xo. 

Examples. (1) x = O(x’), [1, +оо). 

(2) х? = O(x), [-2, 2]. 

(3) sinx = О(1), -о«х« +0, 
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Notice that the symbolic relations we have introduced are not equalities 
in the ordinary sense. For instance, the “equality” sin x = O(l), 
— Фо <x < +00, does not imply that O(1) = sin x. 

The following rules for the symbol “о” can easily be verified: 


(a) 0000) + of) = of), x > хо, x € Q5 
(5) o(/(x) - о(ф(х)) = o(fG0 - (х), x Хо, x €Q; 
(c) о(00/(х)) = of), x ж, хЄй. 


ча Indeed, let g(x) = o(f(x)) and р(х) = o(f(x)) Then (a) becomes 
gix) + 22(х) = о(Д(х)), x > хо since 


.. £(x)tgx) ., m) .. go _ 
шээг oe. ne Or 


It is also easy to verify the following rules: 
(d) oC) + OCO) = OU), x > ж; 
(e) o((x)) : О(е(х)) = OF) · eG), x > ж; 
(f) O(f G9) - Of) = OVX) - (х), x хо; 
(в) O(o(f(x)) = of), AOF) = 0009), x x. 
Recall that f(x) and ф(х) are equivalent or equal asymptotically as 
х — % and we write f(x) ~ ф(х), x > xo, if 
lim 20) = 1 


X — Xo e(x) Е 





Using the list of equivalent infinitesimals and Theorem 7.33, we obtain 
the asymptotic relations 
sinx = x + o(x), x ^6; 
e7 = 1 + x + o(x), x> 0; 
№ (1+ x) = x + o(x), x> 0; 
(1 + x)* = 1 + ax + o(x), x> 0. 


The relations of the form f(x) ~ ф(х), f(x) = o(e(x) and f(x) = 
O(¢(x)) are called the asymptotic formulas or asymptotic relations. 


7.9 Complex Numbers 


Representation of a complex number on a plane. A complex num- 
ber is a relation of the form 
Z=X+1y 
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where x and y are arbitrary real numbers and i| is the imaginary unity such 
that 2 = —1. 

The numbers x and y are called the real and imaginary parts of the 
complex number z = x + iy, respectively. We denote the real and imaginary 
parts of z by 


x=Rez апа у = Imz. 
The complex number x + /0 is thought of as identical to the real num- 
ber x. 
The complex numbers zı = xi + iy; and z = x» + iy; are said to be 
equal (zı = 22) if and only if xı = x» and yi = у». 
Operations on complex numbers. (a) Addition of complex numbers. 
The sum of zı = xi + iyı and z = ж + iy? is the complex number 


= +2 = (0 + x) + in + у»). 


The operation of addition of complex numbers obeys 
(1) the commutative law 


{1 + 2; = Ф@ + 4 
апа 
(2) the associative law 
(а + Z2) + = Zi + (4 +з). 


(b) Subtraction of complex numbers. For any complex numbers z and 
22 there exists the number z such that zı = z + z2. Then the number z is 
called the difference of zy and z and is denoted by 


Z= 41 — 2 = (№ — 0) + (у - y) 


(c) Multiplication of complex numbers. The product 2122 of the complex 
numbers 2 = xı + iyi and Ф = x» + іу is the complex number 


= 2122 = (105 — ny) + пу + xy). 


To memorize this formula it suffices to replace i? by — 1 in the product 
Ол + iy + iy). ' 

If zı and 2 are real numbers the operation of multiplication of complex 
numbers becomes identical to that of real numbers. 

It is easy to verify that multiplication of complex numbers obeys 

(1) the commutative law 


4122 = 2221; 
(2) the associative law 


(zz) = 21 (2223); 
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(3) the distributive law (with respect to the operation of addition of 
complex numbers) 
(a 2) = 4143 + 2243. 
(d) Division of complex numbers. For any complex numbers zı and 22 
(22 ; 0) there exists а number z such that 
Zi = 04. (*) 
The complex number z is called the quotient of zı and zz and is denoted 


by = =. We shall derive the quotient formula for zı and 2. 
2 
«< Let zi; — xi + iyi, @ = X2 + iy; and z = x + iy. Then from (ж) it fol- 


lows that 
Xi —Xx- yy and yı = юу + xy. 


This system of equations is solvable with respect to x and y provided that 
22 #0. We have 
Хю + уу . X291 — X12 


+1 » 
2 2 2 2 
X2 + № № + у 





4 = х + іу = 


The complex number 
z-x-iy 


is called the complex conjugate of the complex number z = x + iy and 
obeys the following rules: 





Z + 22 =Z tz, ZR = 4122, 


а) га а az 
2 2 2 2o 


zz = кр = х? T y 





Trigonometric and exponential forms of a complex number. The com- 
plex number z = x + iy can be represented on the xy-plane by the point 
M with coordinates (x, y) or by a position vector starting at O(0, 0) and 
ending at M(x, y) (Fig. 7.35). We shall call the xy-plane the complex plane. 
We also call the x-axis the real axis and the y-axis the imaginary axis. 

It is convenient to use the polar coordinates (r, 0), where r is the length 
of the vector OM and 0 is the angle between the vector OM and the X-axis, 
to locate the point M in the xy-plane (Fig. 7.36). In polar coordinates 


x=rcos6é, y=rsing. 
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Then the trigonometric form of a complex number is 
z=r(cos@ + i sin), z £0 

with the absolute value (modulus) of z 
r= = Nx «y! = у 20 

and the argument of z 
0 = Argz = argz + 2ka (k = 0, +1, +2, ...), 


where argz is the principal value of the argument. 





Fig. 7.35 Fig. 7.36 


The principal value of the argument of z is defined by 


-т< агр: < п (or 0 « argz < 27) 


ог Бу 
tan! for x » 0, 
x 
m+ dan 57 for x « 0 and y 2 0, 
argz — ят tan 7 for x « 0 and y « 0, 
> for x = 0 and y > 0, 
T 

-3 for x= 0 and y < 0. 


The argument of the complex number z = 0 is assumed to be undefined 
and the modulus of z = 0 is assumed to be zero. 

The complex numbers zı апа 22 which are distinct are equal to each 
other if and only if the moduli of zı and 2 are equal and the arguments 
of zı and zz are either equal or differ from each other by a multiple of 
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2т by a positive integer, i.e., 
lz1| || and Агра = Arg z + 2zn, 


where n is an integer. 
Example. Compute the modulus and argument of the complex number 


eg А т 
= о—5їп— —icos—. 
е sing соз 5 
ча We have 
>. T T 
x= -sing <0 and y= -cosg < 0. 


The principal value of the argument of z is 


2367 -1 mT). _ -1 БЭЕЭ 
arg 5 T 4 tan (coz) T + tan Ge =) | 


= 5 (77:23 EE EA 
= —T + tan (eA) = tag ру 8 
Hence 
Argz = -2 7 + 2km, k = 0, +1, +2, ... 


8 


| = (еіп? т + cos? т = 1. > 





Fig. 7.37 


The relationship between the complex numbers and vectors оп a plane 
enables us to interpret the operations of addition and subtraction of com- 
plex numbers as vector addition and subtraction as shown in Fig. 7.37. 
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It is easy to verify the following 
а + Z| < || + lal. 
а — 2| 2 14| - |z2l. 
When dealing with complex numbers it is helpful to use Euler's formula 
cos + i sinb = е'°, 
which enables us to present a complex number z in the exponential form as 


z= re'? 


It is advisable to present complex numbers in the trigonometric and 
exponential forms when we wish to multiply or divide complex numbers. 
If z = rie?! апа z = ње! then 
гүе eM) = грей! +02), 
44 Indeed 
аф = rie" геі = ricos 0, + i sin61)rz(cos 62 + i sin 6) 
= лт[со$ 0; cos 62 — sin 0; sin 02) + (sin 6, cos 6» + sin 0 cos 61)] 


101 


= ryro(cos(0; + 62) + i sin (01 + 62)) = rir; ей! * 82, 


Thus, multiplication of complex numbers involves multiplication of moduli 
and addition of arguments, viz., 
|Z1z2| = | || 


and 
агр (212) = аг 2; + аге 2. ® 


Analogously, the operation of division of complex numbers сап be 
described as 


iti 
zi ie i Qie1-82 





2 ne? р 
provided that г» z 0. Whence it follows that 


Zl 


22 


_ lal 
| 











апа 
24 
arg —— = arg 2 — arg 22. 
© 


Extracting a root of a complex number. We introduce the operation 
of raising the complex number z to the mth power as 


Ар А 


n times 
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Let us represent the complex number z in the exponential and trigono- 
metric forms as 


z = re/? = r(cos0 + i sin 0). 


Then we arrive at the formula 


z” = (re?) = re"? = r"(cos n6 + i sin n6). 


Setting r = 1, we obtain de Moivre's formula 
(cos + i sin 0)” = cos n0 + i sin n8. 


Based on this experience we can define the operation of extracting a 
root as follows. The complex number w is called the nth root of the complex 
number z if 


w^ = 2. 


In symbols, we write w — Vz. : 
We shall show that for any z # 0 the root Vz assumes п distinct values. 
Substituting 


z= ге! and w = oe/? 
Р n a 
into w = Vz, we obtain 
pře”? = rei?. 


Recall that given two equal complex numbers the moduli of these num- 
bers are equal, and the arguments are either equal or differ by a positive 
multiple of 27. Then, from the above identity it follows that 


o" =r and no = 0 + 2kr 


or 


e = Wand o = LT, (*) 


Thus, the moduli of all the nth roots of z-are equal and their arguments 
differ by the multiples of 2x/n by positive integers. Whence it follows that 
the nth roots can be displayed on the complex plane as the vertices of the 
regular polygon of л sides inscribed in the circle with the radius Viz] and 
centre at the point z = 0 (Fig. 7.38). 

Substituting k = 0,1, 2, ..., n — 1 into (*), we get n distinct complex 
numbers 


Vn = Vr (cos Sd Lnd + 276 + isin ELLE + ont ) А 


К = 0, 1, 2, ..., п - 1 
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Е " j 9*2 
М = Уе ” ,k-20,12,..,n-L (ux) 


3r 
Example. Compute all the values of Vi. 
ч Writing dówn 2 =i in the exponential form, we have 
iT 
1-1-6 2, 


Then we obtain from (**) 


i(E +284) 
мик = е 3’, k=0, 1, 2. 








M TOR 6 г. 

dis 7 6 6 

= al ordo. Ap 
eg ид 
m 3r... 3r 

w,-2e 2 = cos —- + isin- = i 


(see Fig. 7.39). » 


Limits of sequences of complex numbers. Let {Zn} be a sequence of 
complex numbers, where Zn = x, + İyn. A complex number z is a limit of 
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{Zn} if given any є > 0 there exists a number № = Me) such that 
zn- z| «e 


whenever n z N. 
A sequence {Z4} is called convergent to a number z if z is the limit 
of (z,]. In symbols, we write 
z = lim Zn or Zn >Z (n > оо) 


п о 
to mean that z is a limit of z, ОГ Zn converges to 2, or, using logical symbols, 


(lim Zn = z)@ (We >0 ЗУ vn2N,|z-zli«e) 


Every term Zn = X, + iy, Of (z4] is associated with the real numbers 
Xn and y, Hence the sequence (z,} of complex numbers corresponds to 
two sequences {хл} and [у„) of real numbers comprising the real and im- 
aginary parts of terms z,, respectively. 

Theorem 7.34. A sequence (z,} of complex numbers Zn = Xn + iyn is 
convergent if and only if both the sequences {xn} and {yn} of real numbers 
are convergent. 

* let {Zn} converge to a complex number z. This means that for any 

є > 0 there exists N such that |z, — z| < = whenever n > М. Since 

lx. — x| < |4 – z| < € and |yn ^ y| < |4 —z| < € then lim x, = x and 
noo 

lim y, = y. Conversely, if lim x, = x and lim y, = y then for any є > 0 


п со n-*o nd 


there exists Nj such that 
E 
lx. — x| < ~= and |yn – 


E 
үз JS ag 
Thus 
|4 — z| = (Xn + iyn) — & + iy) = V Gs = х?) + (ул yÈ «t. 


Whence шэ Xn + л) = х + іу. > 





Ву ас of this theorem all results concerning sequences of real num- 
bers are fully applicable to sequences of complex numbers. 


Exercises 


1. Prove that the limit of (5) is equal to zero. Verify what n 
n 


satisfy the inequality = < є provided that (a) any є > 0, (b) є = 0.1 and 
n 
(c) e = 0.01. 





2. Prove that the limit of { т 1 | is equal to 1. 
n 
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Compute the following limits 





2 3 2 2 
3. lim 4) a4. lim сл. 5. lim de 
nc n лэю nî + 16п + 2 л-ю» 2п +6 
eu V2 -п-2 ded Mmi +6 -n 
Цай nl лээ JA -н1 XVm +1 
1+ Vn? +2 
8. lim 





йт т 43534045 


(Hint. On evaluating a limit of a quotient of polynomials it is helpful to 
divide the nominator and denominator by п” where p is the greatest expo- 
nent in the denominator. This approach is also used when we evaluate limits 
of fractions involving irrational functions.) 

Compute the following limits 











n! : 1 1 1 
БЕБЕ 1. S RISO ЗЕ : 
езт ане 0 4 x) 
11. lim do4242 n) 12. lim MM, 13. lim (Vn +2 - 
noo n л- о 
Vn. 14 lim(Vm-4n«5 – п). 15. lim n(Wn? +1 - n). 
noo now 
2 2 3 2 
= = : + 3x" + 2 
Та qup oec d те gue xot e 
х-1 x —x x2 -1 X + 3х + 2 x2-2 X -х- 6 
2 
М — 1 
19. lim Ук 1 . 20. lim Z 21. lim Mgr 
х0 x x1 — 49 x0 x 
3 КСЕ 
1d 205 8 3 qa ce 
xo х“ 3х + 1 x^v 1 4 x^ + 3x 


Using equivalent infinitesimals, compute the following limits 

















tan 3x 
24. lim E 25. lim GE (а, В are constants). 26. lim X 
Ql -1 : YA 
jp go Л ae opp HE 26 зөлрөү PH. dg t 
хэд х x20  Sin3x x2« Sin2x x21 SIN Tx 
2 . linx- 2.24 
М а. зах. шил d. ЭХ. quu ОЕ 
x20  Sin*3x roe X-—e х0 х 
ee . lnü-»x») QU et Ls abt 
34. lim . 35. lim — ————. 36. lim ———— —- (a, b are 
2 sin? x roo е*“^— 1 x70 х ( 


constants). 37. lim 1 


xo 


(er ME 1). 
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Compute the limits 


x 3x I 
38. lim (—*_). 39. tim (LY . 40. tim osx)”. 
x>w 1 + x х» x = 2 x70 


2 x 
41. lim (түт | 
xo \2x + 1 
42. Compute the modulus and the principal value of the argument of the 
following: 
(a) 4 + 3i, (b —2 + 2131; (с) -7- i; 
(d) coz + ising ; (e) 4 - 3i. 
43. Write down complex numbers in the trigonometric and exponential 
forms: 
(a) —2; (b) 25 (с) - 5 (d) — V2 + i v2. 
44. Compute the following: or 
(a) (2 — 207; (b) (3 — 305 © 5 - 7) . 


1+1 





45. Compute all the values of the roots: 


(a) УЛ; (5) УБ (с) У-1: (4) YT FT; (е) V2 - 237. 


Answers 
"II > 4; (c) n > 10. 3. 1. 4. 0, 5. œ. 6. V2. 7. —1. 8. 0. 9. 0. 10. 1. 
€ 

11. 1/2. 12. 0. 13. 0. 14. —2. 15. 1/2. 16. 0. 17. —2. 18. —2/5. 19. 0. 20. —1/56. 21. 1/3. 
22. 0. 23. — 2/3. 24. 2. 25. «/8. 26. 3/2. 27. 3. 28. 2/3. 29. —3/2 (put т — x = t). 30. 2/т. 
31. 1/9. 32. 1/e. 33. —1/2. 34. 1. 35. —1. 36. a — b. 37. 1. 38. 1/е. 39. e°. 40. e`". 


41. 0. 42. (а) r = 5, 0 = tan ^ ! 3/4; булав 27, (с) r = 5V2, 0 = -т + tani; 


1.(an»- 


(d r=1, Ө 23 (е) r=5, 0- -tant $, 43. (a) 2(соѕт + isin т), 2e'7; 


ij -8 
(b) 2 cos Z «isin^ ,2e ; (c) cos -3 } +isiaf -2 ‚е ;(d2 блр 
2 2 2 2 4 


2. 3r Ay : : 1 А 
pui ,2e . 44. (a) 2!°1 + 0; (b) 1728; (с) 1. 45. (а) +1, +i; (b) Es (1+ 


3 
(c) x cost — i sin Z E ih dac ; (d) M4 a + i), 7 — cos —+ 
8 8 8 8 2 12 


i sin — J, ¥2( sin Z- — icos— ); (+03 - 2. 
12 12 12 


Chapter 8 


Differential Calculus. 
Functions of One Variable 


8.1 Derivatives and Differentials 


The derivative. Let y — f(x) be a function defined on an open inter- 
val (a, b) and let x be a point in (a, b). Consider a point x + Ax, where 
Ax is a positive or negative increment of the variable x such that the point 
X + Ax is contained in (a, b). The increment Ax will produce an increment 
Ay in y = f(x) so that 


Ay = f(x + Ax) — f). 
We can write a ratio of two increments Ay and Ax as 


Ay _ f(x + Ax) - fo) 
ALT re (Ax z 0). 





For a given x this ratio becomes a function of Ax, i.e., 
Ay 
— = e(Ax). 
АХ e(Ax) 

Definition. 1f there exists a limit of the ratio РА as Ах > 0 this limit _ 
is called the derivative of the function у = f(x) at the point x and is denoted 


by f'(x) or y’ (x) or уу. 
Thus by definition we have 





, . Ay . f(x + Ax) — f(x 
X) = lim = = lim —— in * 
о) Ax20ÀX дх-0 Ах e) 
Examples. (1) Let y = x?. Then for any x and any Ax we have Ay = 
(x + Дх)? — x? = 2xAx + (Ах). Whence M = 2x + Ax and lim 2) = 
Ax—0 


2x. By definition lim T = y' (x). Hence у = x? has the derivative 
4x70 


y’ = 2х, ie, (х2)' = 2x at every point x. 


(2) Let y = e*. Then for апу x and any Ax we have Ay = e** ^^ — е" = 

ЗЭР: wc egt lp а алеи 

e*(e4* — 1). Whence lim = = lim 2—— ——— = ех lim £—.—— = ех 
( ) Ax20ÀX — axo Ax Ax—0 


Thus (e*)’ = e* for all x. 


20— 9505 


306 8. Differential Calculus. Functions of One Variable 








Remark. Sometimes it is convenient to express (*) in the following form. 
Let f(x) be defined at xo and in a neighbourhood of xo. Then 


7 . f(x) — fo) 
f (ro) um X — Xo 
provided that this limit exists. 

We say that the function f(x) has a derivative on (a, b) if this derivative 
f’ (x) exists at each point x € (a, b). 

We leave it to the reader to solve the following problems by applying 
the definition of the derivative given above. 

Problems. (1) Using the definition of the derivative find f’ (0) for the 
function 


х? шү fo хж0, 


70) = 0 for x=0. 

(2) Let f(x) be a periodic function with period 7: Prove that if f(x) has 
a derivative then the derivative of f(x) is also a periodic function with peri- 
od Т. 

(3) Prove that the derivative of an even function is also an even function 
and the derivative of an odd function is an odd function provided that 
these derivatives exist. 

Geometric interpretation of the derivative. We consider the graph of 
a function y = f(x) which is defined on (a, b) (Fig. 8.1) and choose the 
points M(x, /(х)) and Р(х + Ax, f(x + Ax)) on the curve у = f(x). We also 
draw the line through the points M and P. 

Suppose that the point P is moving along the curve y = f(x) towards 
the point M (or, which is the same, Ax tends to zero) so that the line MP 
is changing its position until MP coincides with the line MT. The line MT 
which defines the limiting position of MP as Ax > 0 is called the tangent 
to the curve y = f(x) at the point M. Notice that when the point P moves 
to the point M the angle TMP tends to zero. 

From Fig. 8.1 it is easily seen that the slope k of the line MP is 


Ay 
Ау” 

Let ф # 7/2 be the angle between the tangent МТ and the x-axis. Since 
the slope of the tangent MT to the curve y = f(x) at M is the limit of the 
slope of MP as the point P moves to the point M along the curve and, 
consequently, as Ax > 0, we get 


К = їап а = 





tan e = lim tan o = іт Ау _ lim 
Р-М Ах-ОДХ дх-0 


Лох + Ax) - Д) 
ХҮ . 
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The latter limit, if it exists, is the derivative f’ (x) and is equal to tan е. 
Therefore, the derivative f’ (x) of the function y = f(x) is the slope of the 
tangent to the curve y — f(x) at the point specified by the abscissa x. 





Frx*az,f(x*Ax]) 





r 


Fig. 8.1 


Equations of tangents and normals to a curve. Let a curve be defined 
by a function y = f(x) and let Мо(хо, f(xo)) be a point on the curve. We 
assume that f(x) has a derivative at хо and derive the equation of the tan- 
gent to the curve at Mo. 

The equation of the line through the point Mo(xo, yo) is 


y — Yo = k(x — xo), 


where К is the slope of the line. 
The slope k, of the tangent to the curve y = f(x) at Mo is equal to the 
derivative f’ (xo) so that the equation of the tangent takes the form 


Y — Yo = Л (Xox — xo) Wo = f(xo)). 


The normal to the curve at a given point is the line which passes through 
this point and is perpendicular to the tangent to the curve at this point. 
This implies that the slope kn of the normal is related to the slope k, of 
the tangent as 


kn = — — Or kn = — l 


€ g 0 
ie fo 


20* 
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so that the equation of the normal to the curve y = f(x) at Мо(хо, Yo) is 


1 
Л О) 
When /’ (хо) = 0 the equation of the normal becomes x = Хо. 
Example. Write down the equations of the tangent and normal to the 
curve y = x? at the point О(0, 0). 
< We have f(x) = x’, f' (x) = 2x and f’ (0) = 0. Then the equation of the 
tangent is y — 0 = O(x — 0) or y = 0, ie, the tangent coincides with the 
x-axis, and the equation of the normal is x = 0, i.e., the normal coincides 
with the y-axis (Fig. 8.2). > 


"| 





у— № = (x — хо) C” (xo) # 0). 


у=х? 





м 
: 








Fig. 82 Fig. 8.3 


Application of the derivative in mechanics. Let s = s(t) be an equation 
(a law) of rectilinear motion of a point, which specifies the distance 
travelled by the point as a function of time f. Let As be the distance travelled 
by the point during the time interval Ar from Г to £ + Af, i.e., 


As = s(t + Al) — s(t). 


The ratio As/At is called the mean velocity in the time interval Ar. We 
define the velocity v of the point at the moment / as the limit of the mean 
velocity in the time interval At as At > 0 and write 


: AS 
v(f) = lim == = s'(t). 
(2 im (0) 
Hence the velocity v(f) is equal to the derivative of the distance s with 
respect to time £, ie, u(t) = s’ (£). 
Example. Let us consider the law of rectilinear motion s = £? where 


the distance 5 is measured in metres and the time / is measured in seconds 
and compute the velocity at ¢ = 3s. | 
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ча The velocity of a point at any / is given by 


_ 45 _ 
v= ар 21 
Whence we have v = 6m/sat / = 3 5. > 


Right-hand and left-hand derivatives. The right-hand derivative 
f'(x + 0) of the function y = f(x) at a point x is 


Р . Ay 2 Ay 

x+ 0) = lim = = lim = 

” ) ах-ОДХ дх-0+0АХ 

Ах> 0 
and the left-hand derivative of у = f(x) at x is 

2 . Ay . Ay 

х- 0) = lim = = lim = 

ft ) Ах-ОДЛХ ar+0-0 AX 


Ах<0 


provided the limits involved exist. 

It is easy to see that for the derivative f' (x) to exist at a point x it 
is necessary and sufficient that the function y = f(x) have the right-hand 
and left-hand derivatives at x and these derivatives be equal so that 


J œ + 0) = f'(x — 0) = f' Q9. 


By way of illustration we shall show that there exist functions which 
have right-hand and left-hand derivatives at x but have no derivative at 
x. Consider the function f(x) = 1х1. The ratio 


ЛО + Дх) — f(0)  lAxI 
Ax ~ Ax 





is equal to 1 for Ax > 0 and is equal to —1 for Ax < 0. Hence f(x) = 


Ix! has the right-hand derivative f’ (0 + 0) = lim et = | and the left- 
Ax0 


Ax»0 


hand derivative f’ (0 — 0) = lim Тах = —]at x = 0 but these derivatives 
4x70 


Ax«0 
are distinct. Consequently, f(x) = 1х! has no derivative at x = 0. Geometri- 
cally this means that there is no tangent to the curve y = 1х! at the point 
O(0, 0) (Fig. 8.3). 
Let f(x) be continuous at a point xo. We say that f(x) has the infinite 
derivative equal to + о or to —© at xo if at this point 


, im 27 ; . Ay 
Xo) = lim — = +œ or хо) = lim “2 = — eo, 
Гоо) дх-о АХ Л бо) Ax 0 AX 


respectively. 
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This implies that the tangent to the curve y — f(x) at the point 
(xc, f(xo)) is perpendicular to the x-axis. For example, if f(x) = Vx then 
at x = 0 we have 


Ay _ ЛО + A)-f() VAx 1 





Ax Ax Ax V (ax)? : 
Whence it is easy to see that Ay/Ax tends to +œ as Ax tends to zero in 


an arbitrary manner. The tangent to the curve y = Ух at the point 
ООО, 0) coincides with the y-axis (Fig. 8.4). 








ні 





Fig. 8.4 


ч 





Mol xo, / xg) 








О Хо r 
Fig. 8.5 


Thus, if a function f(x) has a finite derivative at xo then there exists 
a tangent to the graph of y = f(x) at Мо(хо, f(xo)) (Fig. 8.5) and this tangent 
is given by the equation 


y — fGxo) = f' (xo) (x — xo). 
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y=f(x) 


My (2s, f (Xp) 





Fig. 8.6 





Fig. 8.7 
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We say that a function f(x) is smooth on an open interval (a, Б) if f(x) 
and its derivative are continuous on (a, b). A curve corresponding to a 
smooth function is also called smooth on (a, b). 

If a function y = f(x) is continuous at xo and has the right- and left- 
hand derivatives f’ (xo + 0) and f’ (xo — 0) and if f” (xo + 0) z f' (xo — 0), 
then at the point Мо(хо, f(xo)) there exists no tangent to the curve y = f(x). 
In this case the curve is not smooth and at Мо(хо, /(хо)) there exist two 
lines one of which is a tangent to the left branch of the curve and the 
other is a tangent to the right branch. The point Мо(хо, f(xo)) is said to 
be the corner point of the curve (Fig. 8.6). Notice that the point O(0, 0) 
is the corner point of the function (curve) у = Ixl. 

If a function f(x) is continuous at xo and its derivative is infinite at 
Xo the following cases should be distinguished 

(1) f’ (xo) = + =; 

(2) f’ (xo) = — ©; 

(3) f'(xo – 0) = – ә, Ј' (х + 0) = +0; 

(4) f'(xo – 0) = +o, f'(xo + 0) = – ә. 

Figure 8.7 displays the graphs of у = f(x) and tangents х = xo correspond- 
ing to cases (1)-(4). 

Differentiable functions. Let y = f(x) be defined on an open interval 
(a, b) and let x be a point in (a, b). Consider an increment Ax in x such 
that the point x + Ax is contained in (a, b). The increment Ax produces 
the increment Ay in f(x) so that 


Ay = f(x + Ax) — fw). 
The function f(x) is called differentiable at the point x € (a, b) if the 
increment of f(x) 
Ay = f(x + Ax) — ЈО) 
corresponding to the increment Ax admits a representation of the form 


Ay = A Ax + a(Ax) Ax, 


where A is a number independent of Ax (though in general A depends on 
x) and a(Ax) > 0 as Ax 0. 
Example. Let y — x?. For any x and.any Ax we have 
Ay = (x + Ax — х? = 2x Ax + Ax Ax. 
A a 


By definition this means that y = x? is differentiable at any point x and 
A = 2x, a(Ax) = Ax. 

The following theorem specifies the necessary and sufficient conditions 
for a function to be differentiable. 
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Theorem 8.1. For a function у — f(x) to Бе differentiable at any point 
x it is necessary and sufficient that f(x) has a finite derivative f’ (x) at x. 
ча Necessity. Let f(x) be differentiable at x. We prove that there exists a 
derivative f’ (x) at x. Indeed, since y = f(x) is differentiable at x the incre- 
ment Ax in x gives rise to the increment Ay which can be expressed as 


Ay = А Ax + a(Ax) Ax. 
Whence 


АУ = A + о(Ах), 


where A is independent of Ax, ie, is constant at a given point x and 
a(Ax) ^ 0 as Ax > 0. 
Theorem 7.19 implies that 


- Ay 2 
A = lim = = f' (x). 
ШЙ А Го) 
Thus a derivative of f(x) at x does exist. 
Sufficiency. Let f(x) have a finite derivative f' (x) at x. We prove that 
(х) is differentiable at x. Indeed, since f” (x) exists at x there exists a limit 


of АУ as Ax > 0 so that 


. Ду ; 
іт — = ; 
c 


Whence, by virtue of Theorem 7.19 it follows that 
АУ = /' 09 + alx), 


where a(Ax) ^ 0 as Ax — 0 and, consequently, 
Ay = f' (x) Ax + a(Ax) Ax. (*) 


Since f’ (x) is independent of Ax and a(Ax) > 0 as Ax > 0 from (*) we 
infer that y = f(x) is differentiable at x. > 

Theorem 8.1 establishes a one-to-one correspondence between the no- 
tion of a function differentiable at a point and the notion of a function 
having a finite derivative at this point, that is, if f(x) is differentiable at 
a point then it has a finite derivative at this point and vice versa. So the 
operation of computing a derivative of a function is also called the differen- 
tiation of a function. 

Continuity of differentiable functions. We shall prove the theorem 
which establishes a relationship between continuous functions and differen- 
tiable functions. 
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Theorem 8.2. If a function y = f(x) is differentiable at a point x then 
f(x) is continuous at x. 
~ Indeed, if f(x) is differentiable at x then the increment Ay corresponding 
to the increment Ax in x admits a representation of the form 


Ay = A Ax + a(Ax) Ax, 
where A is constant at x and a(Ax) ^ 0 as Ax ^ 0. Whence lim Ay = 0 
4x70 


so that f(x) is continuous at х. > 

The converse is not true, namely, if f(x) is continuous at x it is not 
necessarily differentiable at x. For example, f(x) = |x| is continuous at 
x = 0 but has no derivative at x = 0. Hence, f(x) is not differentiable at 
x = 0. 

Example. The function 


xsinl for x #0, 


fe) = 0 for x=0 


is continuous on (— œ, + oo). For all х #0 f(x) has a derivative; however 
J(x) has neither right- nor left-hand derivatives at x = 0 since 


Ax sin i5 1 
— sin 
Ax Ax 
has no limit as Ax > 0 + 0 and as Ax 0 — 0. 

The examples we have considered involve functions each of which has 
no derivative only at one point in its domain. The idea that a function 
may have no derivatives only at a finite number of points prevailed in the 
eighteenth and early nineteenth centuries. However, later on various emi- 
nent mathematicians offered examples of functions continuous on a closed 
interval [a, b] and having no derivatives at any point of [a, b]. 





The differential. Let y — f(x) be differentiable at a point x, i.e., let an 
increment Ax in x produce an increment Ay in f(x) so that 


Ay = A Ax + a(Ax) Ax, 


where a(Ax) > 0 as Ax > 0. 

If y — f(x) is differentiable at x the linear part A Ax of Ay, provided 
that A z 0, is called the differential of y = f(x) and is denoted by dy or 
by df(x) so that 


dy=A Ax. 
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When A z 0 we say that A Ax is the principal linear part of Ay since 
a (Ax) Ax is an infinitesimal of higher order than А Ax as Ax ^ 0. If A = 0 
the differential is equal to zero. 

By virtue of Theorem 8.1 we have A = f’ (x). Then the differential dy 
becomes 


dy = f' (x) Ax. 


Along with the notion of the differential of a function we can introduce 
the notion of the differential of an independent variable x by putting 
dx = Ax. Then the differential of y = f(x) can be written as 


dy = f' (x) dx. 
Whence we have the Leibniz notation for the derivative: f' (x) = 2 . Thus 
a derivative of f(x) can also be thought of as a quotient of two differentials 
dy and dx. 
We say that a function y = f(x) is differentiable on an open interval 
(a, b) if f(x) is differentiable at every point of (a, b). 





Fig. 8.8 


To interpret the notion of a differential geometrically we use Fig. 8.8. 
Let a function у = f(x) be differentiable on some open interval (а, b). We 
draw a tangent to the curve f(x) at a point M and choose a point М, whose 
abscissa is x + dx. Clearly, f' (х) = tan e. Consider the triangle MPQ. It 
is easy to see that 


РО = MP tan e = f' (x)dx = dy. 
Thus, the differential dy = f’ (x) dx of the function y = f(x) is the incre- 


ment of the ordinate of the tangent to the curve y — f(x) at M when x 
is given the increment dx. 
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82 Differentiation Rules 

The derivative of a constant function. If y — C — const at every 
point of (a, b) then y’ = 0 in (a, D). 
48 Indeed, if y= C vx (a, Б) then for any x € (a, Б) and any Ax such 
that x + Ах є (а, b) we have 


Ау-С-С-0 and ^7=0 (Axs0) 


Ax 
Whence 
. Ay 
= lim = = 0 vx € (a, b). 
pum (a, b) 
Thus 


(C)’=0 апа dC=0. » 
The derivative of a sum of functions. Let u(x) and v(x) be differentiable 
at x. Then the sum y(x) = u(x) + v(x) is also differentiable at x and 
y'Q) = (u(x) + vd)" = и'(х) + v'Q9. 
* Indeed, the increment Ax in x produces the increments in u(x) and in 
u(x) so that 
u(x + Ax) = u(x) + Au апа v(x + Ax) = у(х) + Av. 
Then the increment Ay in y = u(x) + u(x) becomes 
Ay = Ки + Au) + (v + Av)] — (и + v) = Au + Av. 


Whence 
Ay _ Au 4 Av. 


Ax Ах Ax ө) 


Since u(x) and v(x) are differentiable at x there exist the derivatives 
u' (x) and v'(x) at x. Then each summand in (*) has a limit as Ax > 0 
so that there exists a limit of the right-hand side of (*) equal to u’ (x) + 
v ' (x). Hence, the left-hand side of (*) also has a limit as Ax > 0. In other 
words, there exists 


. Ay 
lim —— = y’ (x). 
Ла у (х) 
Thus, evaluating the limit of (ж) as Ax > 0, we obtain 


уб) = (и(х) + v6)! = u w) 40703) 
and 
d(u + v) = du + dv. > 
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Analogously, we prove that 


(u(x) — v(x))’ = и'(х)— v'() 
and 
d(u — v) = du — dv. 


We can easily extend these results to any finite number of differentiable 
functions. 
Example. Find the derivative of y = x? + e* + 2. 


я y-(«e«2'-Q) +(е)' + (2), =2х+е » 
The derivative of product of functions (product rule). If u(x) and v(x) 
are differentiable at x so is the product u(x)v(x) and 
(u(x) v(x)" = u'GOv(x) + и(х)о' (х). 
< Let Ax be an increment in x. Then u(x) and v(x) receive the increments 
Au and Av which give rise to the increment 
Ay = (и + Au)(v + Av) — uv = v Au + и Av + Au Av. 


Consider the ratio 


Ay | Au Av Av 
Ax АХ АС ^ АУ (*) 





We prove that each factor in the right-hand side of (*) has a limit as 
Ax > 0. Indeed, at a given point x u(x) and v(x) are constant. Since u(x) 
and v(x) are differentiable at x there exist the derivatives 


. Au . Av 
(х) = == d uv’) = lim —. 
u’ (x) Дип апа v'(x) mE 


Since u(x) and v(x) are differentiable at x they are continuous at x so that 
Au and Av tend to zero as Ax > 0. Thus the right-hand side of (*) has 
a limit equal to v(x)u' (x) + u(x)v' (x) as Ax > 0. Hence there exists a limit 
of the left-hand side of (*), ie, there exists lim АУ = y' (x). Evaluating 
Ах-0 
the limit of (*) as Ах- 0, we obtain ` 
J'(x) = (uG)vGQ9)' = и’ (х)о(х) + и(х)»' 00. 
Whence 
d(uv) = vdu + udv. > 


Example. Find the derivative of y = (х? — 1)(e* + 2). 


я y’ = (2 — 1)(е* + DY = ê – 1)'(е* + 2) + (х2 — 1)(е* + 2) 
= 2х(е* + 2) + (х2 — De = (x? + 2х – 1)е* + 4х. > 
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Corollary. If a function is multiplied by a constant factor its derivative 
(or differential) is multiplied by this factor, i.e., 


(Cu(x)) = Cu'(x) and d(Cu(x)- Саи (C = const). 
The product rule is easily generalized to any finite number of differen- 
tiable functions so that 
(u10)ux(x) ... иһ(Х))' = ui(x)uz(x) ... Unt) 
+ uux) ... ил(Х) +... + min) ... usr). 
The derivative of a quotient of functions (quotient rule). If u(x) and 


v(x) are differentiable at a point x and if v(x) z 0 at x then the quotient 


у(х) = uo) is differentiable at x and 


u(x) 
7 ux u'v — uv’ 
4 7 ( ) E 2 1 
у у 


и аи – ud 
a (4) -” Е E 


у у 





(v(x) # 0). 





ча Since v(x) is differentiable at х, v(x) is continuous at x and, by virtue 
of Theorem 7.22, v(x + Ax) # 0 for all sufficiently small lAxl. Then the 
ratio 


u(x + Ax) _ u + Au 
р(х + Ах) vu + Av 





is defined for all sufficiently small | Axl. 
The increment Ax in x produces the increment 


и + Ди и оДи- иду 








A = = 
á v+ Ду о v? + v Av 
Whence 
v Au — u Av 
Ay Ax Ax | (+) 


Recall that u(x) and v(x) are differentiable at x. This implies that there 

exist lim Ай: = u’(x) and lim avs v'(x) and Av > 0 as Ax > 0. Also, 
Aro АХ ar+o0 AX 

at a given point x the values of и and v are constant and, as stated, v(x) = 0. 


The right-hand side of (+) thus has a limit as Ax > 0 equal to Зи 00. = HU : 
v 
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Hence there exists a limit on the left of (ж), i.e, there exists lim x = y’(x). 
Ах-0 


Evaluating the limit of (5) as Ax > 0, we obtain 


you (sy = w Ow -uo 











v(x) v^(x) 
(v(x) = 0). > 
d (3) _ vdu ий ‚ 
у U 
Example. Find the derivative of y — = = 
х + 5 
, fe-1 (е-ро? + 5) = ("= Dee + 5)’ 
Б; шы Е ares 
ХА +5 (x? + 5)? 


_ еҷ? + 5) = (е — 1)2х _ (х? – 2x + 5)е* + 2x 


(x? + 5)? (х? + 5)? 2 





Problems. (1) Determine whether f(x) + ф(х) is differentiable at a point 
x or not, provided that f(x) is differentiable at x and (x) is not. 

(2) Suppose that f(x) is differentiable at xo, f(x) = 0, and ф(х) is not 
differentiable at xo. Prove that /(х) ф(х) is not differentiable at xo. 

(3) Suppose that /(х) and e(x) have no derivatives at xo. Does it imply 


that (a) f(x) + (0, (b) Ло) ф(х) and (c) Л E have no derivatives at xo? 


(Consider the following functions: (i) f(x) = Ixl, ф(х) = – 1х1, ж = 0; 
(ii) SO) = ф(х) = Ixl, xo = 0; (ii) f(x) = eœ) = 1х1 + 1, xo = 0) 

Derivatives of some elementary functions. (1) Exponential function 
y = а (a > 0,а z 1). This function is defined at every point of the number 
line. Hence, for any x and any Ax we have 











Ay EG qe а a Z a*(a™ 2 1) 
and, for Ax z 0, 
Ay. gra - d 
Ax Ах? 
s Ay : x ах -1 X qu a^* — 1 x 
lim — = lima = q* lim = q* lna. 
Ax20ÀX — Ax—0 Ax ах-0 AX 


Thus 
(a*)’ = а In a. 
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In particular, if a = e then 
(e*)’ I 


(2) Logarithmic function y = ln x (x > 0). For any x > 0 and any Ах 
such that x + Ax > 0 we have 


= In@ + Ах) – In x = In (1+ 2). 














Whence 
Ax 
Ay _ In (1 + 2) 
Ах Ах 
and 
(io) „=, 
lim £2 = lim EJ mote. 
АХ-0 x0 Ax Ax 0 АХ x 
Thus 
(In x)’ = — 
Using the identity 
log; x = loga e Inx (а> 0, a# 1) 
we arrive at 
; , lge 1! 
(log, x)’ = log, e (In x)’ = volcan 


and, finally, 
is 
(ова x) ~ xlna’ 

(3) Power function у = x” (а is an arbitrary real number). This function 
is defined at least for all x > 0. Then 


Ау = (х + Ах) – х" = х" (1-2) -1 


and 
Ax а 
ГЭВ” tx 1 
Ax 
Recall that (1+ a -1 ~a as Ax > 0 (see Chap. 7). Then 
Ах Үү? a Ax 
Ay Тр — 1 zi 


2 4 „1; x a 
lim — = x“ lim = x“ lim —— = ax 
Ax 0 AX АХ-0 Ах-д0 Ах 
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and 
(x?) pem ах“ - 1: 


(4) Trigonometric functions. Consider у = sin x, — oo < x < +œ. For 
any x and any Ax 


Ay = sin(x + Ax) — sin x = 2 sin ^* cos («+ 3) 


and 





Since y=cosx is a continuous function at any point х, 








. Ах 
АЎ sin — 
lim cos (« + 2) = cos x and lim = 1, we obtain 
Ax 0 2 ах-о AX 
2 
sin — 
. Ay : 2 Ах 
lim — = lim cos [x + = — COS X. 
Ах-0 Ах-0 Ax ( 2 ) 
2 
Thus 
(sin x)’ = cos x. 
Analogously 


(cos x)’ = —sin x. 


Using formulas for derivatives of sin x and cos x, we easily arrive at 


= sec? x 








in * sin x)’ cos x — sin x(cos x)’ 1 
бап х)’ = (E) = ES COS > ( цан 2 
COS X cos” X cos x 


so that 


(tan x)’ = L = sec? x (eim. n — 0, +1, +2, 2 
cos? x 2 





Analogously 


(cot х)’ = — — L = —cosec?x (x # пт, п = 0, +1, +2, ...). 
sin^ x 





21—9505 
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83 Differentiation of Composite and Inverse Functions 


Derivatives of composite functions. First we shall prove the follow- 
ing important theorem. 
Theorem 8.3 (chain rule). Let и = ф(х) be differentiable at хо and let 
y = f(u) be differentiable at uo = ф(хо). Then the composite function 
y = fle] is differentiable at хо and 


{Ale QM] J la - x = f’ (40) e (хо). 


«1 Following our usual approach we consider the increment Au in и = ф(х) 
corresponding to the increment Ax in xo. The increment Au produces the 
increment Ay in y = f(u). 
Since y = f(u) is differentiable at ир we can write 
Ay = f' (uo) Au + a(Au)Au, (8) 
where a(Au) > 0 as Au > 0. 
The function a(Au) is indeterminate as Au = 0. If we set a(0) = 0, 
a(Au) becomes continuous as Au = 0. 
Dividing both sides of (») by Ax (Ax = 0), we obtain 
A , А 
АУ = /' (ш) 28 + alan) 25. (9) 
Also, и = ф(х) is differentiable at хо and, consequently, continuous at 
Xo. Hence, Au — 0 as Ax > 0. This implies that «(Аи) tends to zero and 
ац > ф (хо) as Ax > 0. Therefore the right-hand side of (**) has a limit 
equal to f' (uo) e' (xo) as Ax > 0. So there exists a limit of the left-hand 
side of (жж), i.e, lim T , which is the derivative of the composite function 
ах-0 


y = flex] at хо with respect to the variable x. 
Evaluating the limit of (*««) as Ax > 0, we obtain 


(Ie 091) rl«- x. = f’ (40) e Оо), 


where f’ (uo) designates the derivative of f(u) with respect to the variable 
и at the point uo = (Хо) corresponding to the point хо of the variable х. 
This identity can be written as 


dy _ dy du 


dx du dx 
or 


Ух = Yuly. В» 
Examples. (1) Find the derivative of у = e? * 
-«4 Here y is a composite function of x which can be expressed as у = е“, 
where u(x) = sin x. Then 
Ye = (eiu = е“ cos х= e 


sinx 


cos x. В 


8.3 Differentiation of Composite and Inverse Functions 323 


(2) Find the derivative of y = In Ixl, x z 0. 
ча This function is even and is defined at every point of the number line 
except at x = 0. If x > 0 then lx! = x and In Ix! = In x so that 


yx = (In x)’ =i (x > 0). 
If x « 0 then Ixl = —x and In Ixl = In (— x). 
Set у = In и апа u = —x. Then y = In (- х) becomes a composite 
function. By the chain rule we have 
кз уйуй bes y ol 
Ух = уши, =; CDs; CDs 
so that for x « 0 у; -1 
Thus 
(п Ix!) =} (х з 0). > 


Remark. Тһе chain гше is also applicable when we consider апу finite 
chain of functions. For example, if y = f(u), и = e(z) and z = ¥(x) so that 
y = ft ely(x)]) and if there exist the derivatives f4, yz and үү then 


Ух = Уийх- 
Invariance of the form of the differential. If y = f(u) is a differentiable 
function of an independent variable 4 then 
dy = f' (u)du, (*) 


where du = Au. 

Let и be a differentiable function и = e(u) of a variable x. Then we 
may consider y as a composite function y = Д[(хХ)] of the variable x. Since 
x is an independent variable we can express the differential dy of the com- 
posite function y = f[e(x)] as 


dy = Лео) dx. 
The chain rule yields 
(Лео: = 7 097 69 
and we can write 
dy = f' (u)e' (х)ах. 
Since e'(x)dx = du we again get the identity (*) 
dy = f' (u)du. 


21* 
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Therefore the differential of a function is expressed by the same formula 
irrespective of whether the argument of this function is an independent 
variable or a function of another variable. This property is sometimes called 
the invariance of the form of the differential. It is worth mentioning that 
in the formula dy = f' (u)du the differential du is equal to an arbitrary 
increment Au of an independent variable и or, when и = ф(х), du = 
vy’ (x)dx is a linear part of the increment of и = ф(х) and, in general, is 
not equal to Au. 


Differentiation of inverse functions. Let y — f(x) be defined on a closed 
interval [а, b]. Suppose that the range of у = f(x) is the closed interval 
[c, 8] on the y-axis. Furthermore, let each у in [a, 8] correspond to 
the only x іп [a, 5] so that f(x) = y (Fig. 8.9). Then we can specify the 
function x = ф(у) on [a, В] by associating every y in [a, 8] with x in 
[a, b] such that f(x) = у. The function x = (y) is called the inverse func- 
tion of y = f(x). 

















Fig. 8.9 


Clearly, if x = ф(у) is the inverse of y = f(x) then у = f(x) is the inverse 
of x = e(y). In this case we speak of y = f(x) and x = ф(х) as of mutually 
inverse functions. We can write 


Лео) = у and pV] = x 


provided that y = f(x) and x = e(y) are mutually inverse functions. 
We shall follow a more constructive approach to specification of an 
inverse function. Let y — f(x) be an equation solvable for x so that every 
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y is associated with exactly one x. Then we have the equation x = (y) 
which defines x as a function of y. The function x = (y) is the inverse 
function of y = f(x). 

Examples. (1) If y = 3x is defined on [0, 1] then the function x = y/3 
defined on [0, 3] is the inverse function of y. 

(2) The inverse function of у= x, —-e «x« +œ, is x= УУ, 
—о<у< «оо, 

(3) Let 

_ (x for rational x, 
шан К — x for irrational x. 


Then the inverse function of y(x) is 


y for rational y, 
x- хе 
1— у for irrational y. 





ч) 





Fig. 8.10 


Clearly, the equations у = f(x) and x = (y) specify the same curve on 
the xy-plane. If we display independent variables on the x-axis in both cases, 
i.e., if we plot the functions y = f(x) and y = ф(х) instead of y = f(x) and 
x = ф(у), their graphs will be symmetric relative to the line bisecting the 
first and third quadrants of the coordinate plane (Fig. 8.10). 

We say that y — f(x) is an increasing function on [a, b] if given any 
xı and ж in [а, b] such that xı < хә there holds f(x1) < f(x). 

For instance, f(x) = x? is an increasing function for — oo < х < + оо. 
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Theorem 8.4. Let y — f(x) be a continuous and increasing function on 
[a, b] and let f(a) = a and f(b) = 8. Then y = f(x) has an inverse function 
x = (Уу) оп (ав, 8], x = (у) being continuous and increasing on [a, 6]. 

We confine ourselves to a geometric interpretation (Fig. 8.11) of Theo- 
rem 8.4. The curve AB represents the graph of y = f(x) which is continuous 
and increasing on [a, b]. Every y in [o, 6] corresponds to the only x in 
[a, b] so that f(x) = y. Thus AB sets a one-to-one correspondence between 
x and y and we may consider x as a function of y on [o, 8], i.e., we may 
say that х = (y) is the inverse function of y = f(x). The function x = e(y) 
represented by the curve AB is continuous and increasing on [a, 6] since 
as y increases, so does x. 

Similar consideration is fully applicable to any continuous function 
which is decreasing on [a, 5]. 

















Fig. 8.11 





Theorem 8.5. Let y = f(x) have a derivative f' (xo) # 0 at a point xo 
and let there exist an inverse function x = (y) of y = f(x), x = e(y) being 
continuous at yo = f(xo). Then the inverse function x = e(y) has a deriva- 
tive at the point yo and 


; _ 1 à 
¢' (vo) = fo9 C (xo) = 0). 


ча Consider x = (y) and an increment Ay in y for y = yo. Then we can 
write the corresponding increment Ax in x = (y) as 


Ax = po + Ay) — ФОУ). 
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Notice that Ay = f(xo + Ax) — f(xo). Then Ay » 0 implies that Ax ж 0; 
otherwise the given point xo would correspond to two distinct values 
Yo = f(xo) and yo + Ay which is impossible by the definition of a function. 
Against this background we can write the ratio Ax/Ay (Ay z 0) as 


Ax me. En IM 
Ay  Ay/Ax' 


If Ay tends to zero so does Ax, for x = (у) is continuous at уо. 


(*) 


Since y = f(x) has a derivative at xo, lim ау = f’ (хо). We also have 
Ах-0 


J’ (xo) # 0. Hence there exists a limit of the quotient 





1 
As ATE 0 (and, 





consequently, as Ax > 0) and this limit is equal 05 | y Then (*) implies 
Xo 


that there exists a limit of Ах as Ду > 0 and 
паа e. 
ay^0AÀy Ј' (х) 
On the other hand, the limit of 53 as Ay > 0 is the derivative y’ (yo) of 
= (y) at the point у = уо. Hence we have 


1 
I = —, > жж 
e'O)- у о у T (жж) 

It is easy to interpret Theorem 8.5 geometrically. If у = f(x) has 
a derivative at xo then there exists a tangent to a graph of y = f(x) at 
Mo(xo, f(xo)) and if this tangent is not parallel to the x-axis, then it is also 
a tangent to the curve x = (y) at Mo (Fig. 8.12). (Notice that the functions 
y = f(x) and x = (у) are inverse to each other and are specified by the 
same curve) Looking at Fig. 8.12 we see that /'(xo) = tana, e'(yo) = 
tan В and a + B = «/2 so that tan 8 = tan(z/2 — a) = cota = l/tan a, 
i.e., 


¢’ Oo) = 


f Оо) = 
We can also write (**) as 
Го) = PUT or xem (xj s 0). 


Differentiation formulas for inverse functions are easily obtainable by 
applying the chain rule. Indeed, let y = f(x) and x = e(y) be mutually in- 
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verse functions. Then e[/(x)] = x. On differentiating both sides with 
respect to x and using the chain rule, we get 


€, fi = 1 
Pens qoae: 


Фу 
Differentiation of inverse trigonometric functions. (a) The function 
y = sin”! xis defined on the closed interval [— 1, 1] (Fig. 8.13). Notice that 
y = sin^! x is the inverse of x = sin y defined on — т/2 < y < 7/2. 
On the open interval —1 « x « 1 x = sin y has the positive derivative 


ху = cos y for each y €(— 7/2, 1/2). Then there exists the derivative 
у= 11-22 1 2 1 
* x cosy wWl-simy М1-х 
where —1 < x < 1 and the “+” sign of V1 — x? is chosen, since cos y > 0 
for all y € (— 7/2, 2/2). 





у= өм! 

















* 
Fig. 8.13 - Fig. 8.14 
Hence 
Sin! x) = Fs, —1<х<1. (8) 


The points x = +1 are deleted, for the derivative ху = cos y is equal 
to zero at y = + п/2 where the right-hand side of (*) becomes undefined. 
(b) The function y = tan ! x, — о « x < + co, (Fig. 8.14), is the in- 
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verse function of x = tan y, — п/2 < y < 2/2. Then 








БЭХ ЭЕ ИЕМЕ eee 
у 1 1+їап у 1+х° 
cos? y 
Hence 
2 1 
tan ! x)’ = for all x. 
( ) 1 х? 


(c) To derive the formulas for the derivatives of cos ! x and cot ^! x 
it suffices to notice that 


sin ^! x + cos! x = 2/2, 
tan ! x + cot ! x = т/2. 





Whence 
(cos! x)’ = — a for -l<x<], 
1-Х 
cot ! x)! = - for all x. 
( ) 14x? 


Differentiation of hyperbolic functions. By definition 


x =E X а? 4 
е-е e+e 
-————— and cosh x = —— ——. 


sinh x = 5 5 


Then 


хо — , 
(sinh x)’ = (t) = cosh x 


(=) е sinh x. 


Also by definition 


and 


(cosh x)’ 


tanh x = sinh x and cothx = cosh 25, 
cosh x sinh x 





Using the quotient rule and the identity cosh? x — sinh? x = 1, we obtain 








inhx \’ _ cosh? x – sinh? x 1 
tanh x)’ = IBI) _ cosh’ x= sinh? x _ 1 — 
( ) ( cosh x cosh? x cosh? x 
and 
hx \’ _ sinh? x — cosh? x 1 
th ‘= cos жы ee eS ed x x 0 Я 
сон) (с х sinh? x sinh? x ( ) 
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Differentiation of basic elementary functions. Below is the list of for- 
mulas widely used in differential calculus. 











(x*)’ = өх 7! (a is any real number, x > 0); 
(loge x)’ = —L— (а> 0, а#1,х> б); 
(n3)! == (к> 0); 
(a)' = апа (а> 0, а # 1); 
(e*)’ Ser 
(sin x)’ = cos x; 
(cos x)’ = —sin x; 
(tan x)’ = 5 х1 + пт, п= 0, rl, +2, ... |}; 
cos^ x 2 
(cot x)’ = — 2 (Ann, n=0, +1, +2, ...); 
ѕіп? х 
(іп) x)’ = -- ; (=1<x< I); 
-x 
(cos ! x)’ = ^4 (-1«x« 1); 
-x 
- 1 
tan ! x)’ = : 
( ) 1 +x’ 
A 1 
сої! x)’ = - ——,; 
( х) 1 +x? 
(sinh x)' = cosh x; 
(cosh x)’ = sinh x; 
1 
tanh x)’ = ——,_; 
( ) cosh? x 
Р 1 
(coth x)' 2 — Sinh? x (x я 0); 


Logarithmic differentiation. Sometimes it is much easier to find a 
derivative of a natural logarithm of a function than that of this function 
itself. This gives rise to a convenient differentiating procedure called the 
logarithmic differentiation. 
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Suppose we are seeking for the derivative of a positive function y = f(x) 
and assume that it is much easier to compute the derivative of (x) = 
In y = In f(x). Then differentiating this function with respect to x, we 
obtain 


, 


= ф'(х). 


|< 


Whence y' = y:e'(x) or 
y’ = f(x) (In Д(х))'. 


This procedure is especially helpful when we deal with composite ex- 
ponential functions (power-exponential functions) of the form 


y = uQ 9, 
where u(x) and v(x) are both differentiable and u(x) > 0. 
Then, by logarithmic differentiation we have 
In y = v(x) In u(x) 
and 
u'(x) 


u(x) ` 





x = v’ (x) In u(x) + у(х) 


Whence 





»'G9 = [wenn (oe In u(x) + v(x) a). 


u(x) 


Example. Find the derivative of y = x^, x > 0. 
ча Taking the natural logarithm of y = х", we have 


Iny =x In x. 


Differentiating with respect to x gives 
Y х+1. 
y 
Whence 
y'= у(пх + 1) 
or 
y —-x(Inx-4l. > 
Applying differentials to approximate computations. Let y — f(x) be 
differentiable at a point x so that an increment in y admits a representation 
of the form 
Ay = f' (x) Ax + a(AX) Ах (Ax 0), 
where f' (х) Ax = dy and a(Ax) > 0 as Ax > 0. 
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Let dy ғ 0 and, consequently, f’ (x) z 0. Then 


Ay _, , a(Ay) 
dy 1 * fo 





Whence lim Z = ], ie., the infinitesimals Ay and dy are equivalent as 
4x70 


Ах 0. This means that the difference Ay — dy = a(Ax) Ax is an in- 
finitesimal of higher order than Ay and dy as Ax > 0. Hence we may use 
dy as an approximation of Ay, i.e., Ay = dy, and the relative error can be 
made whatever small for any sufficiently small |Ax!. Then we can apply 
the relation 


I(x + Ax) = f(x) + f' (х)Ах 
to approximate values of f(x). 
Example. Let y — x? (8 is any real number). Then 
= (x + Ax? — хб, 
dy = (ix? ^ ! Ax. 
Given small lAxl, we have 
(х + Ax? = xf + dy 
Or 
(x + Ax? = x? + Bx?^ ! Ax. 
In particular, for 8 = 1/2 
Vx + Ax = Vx + — АХ (x # 0). 
2Үх 


For instance, for V3.9978 = V4 + (— 0.0022). Putting x = 4 and Ax = 
—0.0022, we get 











43.9978 = V4 + (—0.0022) = V4 + С 0.0022) = 1.99945. 


8.4 Derivatives and Differentials of Higher Orders 


Derivatives of higher orders. Let /(x) have a derivative at every point 
x in an open interval (a, b). Then the derivative f“ (x) of f(x) is a function 
defined on (a, b). It may happen that / (х) has a derivative at a point 
x in (a, b). This derivative of f' (x) is called the second derivative of f(x) 
or the derivative of the second order of f(x) and is denoted by f” (x) or 
by f? (x). Thus 


=C. 
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Similarly the nth-order derivative of f(x) is the derivative of the 
(n — 1)th-order derivative of f(x), 1.е., 


f?) = g* Po). 


To compute the nth derivative / (x) one has to find the first derivative 
ТОО) of f(x), then the second derivative f” (x) as the first derivative of 
J’ (х) and continue this process until the nth derivative is computed. There- 
fore the rules and formulas of computing the first derivatives are sufficient 
to obtain derivatives of any desired order. 

Examples. (1) Compute the nth derivative of у = е“ where К = const. 
~ We get in succession у’ = ke, у” = K?e**, y" = ке‘, ... . By in- 
duction we can easily prove that 


(e^)? = k^e** (n = 1, 2, 3, = ). » 


(2) Compute the nth derivative of y — sin x. 


. т " 5 5 c 
4 We have y’ = cos х = sin (5-2) y" = -sin x = sin(x + т) = 


: T 
sin (+23 m 


By induction it follows that 


(sin x)? = sin (х + n3) for all ne N. » 


(3) Compute the nth derivative of y = cos x. 
ча Following the usual approach we have 


2 


The set of all functions f(x) which are defined on (a, b) and have con- 
tinuous nth derivatives at every point х € (a, b) is denoted by C"(a, b). 
We say that a function f(x) is infinitely differentiable on (a, b) and write 
Јо) € C"(a, b) if f(x) has derivatives of any order at each point хє 
(a, b). For example, the functions e", sin x and cos x are infinitely differen- 
tiable on (— œ, + о). > 

(4) Compute all the derivatives of the function у = х“. 
ча We have y = 4x3, yO = 12x?, yO = 24x, y? = 24. Since у® is a 
constant all derivatives of higher orders are zero, ie, y = yO = ... = 
y? = 0. > 

To give a physical interpretation of the second derivative we consider 
the law s = s(f) of a rectilinear motion of a point. Then the first derivative 
S’ (À = v(r) specifies the velocity of the point at the moment / and the 
second derivative s" (f) = v’ (£) is the acceleration of the point at f. 


(cos x) = cos (« +n 3) for all 1 € N. 
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Leibniz formula. Let u(x) and v(x) have the nth derivatives. Then 
(а) if у(х) = u(x) + v(x) then y? (x) = u? (x) + v? (x; 
(b) if у(х) = u(x)v(x) then 

у= и'о+ир’, 

y” =u"v + 2u'v' + uv", 

y" —u"v + 3u"v' + 3u'v" + uv" and so on. 


It is easy to notice that the formulas on the right resemble the binomial 
formulas for (и + v), (и + v)? and (и + v)? with the exponents indicating 
how many times и and v are to be differentiated. The resemblance becomes 
nearly complete if и and v are replaced by “ and v O9), standing for deriva- 
tives of the zero order. Using the method of mathematical induction, we 
can show that in general 


(uv) = n9. y + Ю y DUO + mn 1) 0-20 


" n(n- 1)... (п—-К+1) це. a 


(л) 
ki . + uur’. 


= share 





This relation is called the Leibniz formula. 

Example. Using the Leibniz formula, compute the derivative y" of 
the function y = x?e*. 
ча We have 


xya00Dy2 + 1001 (e?) 29. (x?) 


Z aiye ы 
y= -x = (е 17 


Ys 1001 x 1000 


J (е*)®°9Э(х?у” + 0 = e*x? + 2002e*x + 1001 х 10%е*. > 


We shall derive another useful formula. Let x = p(y) and у = f(x) be 
mutually inverse functions and let f'(x) = 0. Then © 


and 








d d (1 d {1\ dx yy 1 yy 
X " а (х ЧАБР = = — — = - ———, 
У = ау = б) dx (5; dy Од? у; Од? 


Differentials of higher orders. Let у — f(x) be differentiable at a point 
x. It may happen that at x the differential dy — f' (x) dx is a differentiable 
function of x. Then there exists a differential of a differential of a given 
function which is called the differential of the second order or the second 
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differential of y = f(x) and is denoted by 42у so that 
d’y = d(dy). 
Analogously we can define differentials of higher orders. The differential 


of the nth order "у от the nth differential of a function y = f(x) is the 
differential of the (n — 1)th differential of y = f(x) so that 


d"y = d(d" ^! y). 
The differential dy may be called the first differential ox the differential 
of the first order of a function y = f(x). 
Now we shall derive some important formulas for differentials of higher 
orders. Let y — f(x) be a function of an independent variable x and let 
y — f(x) have differentials of any order. Then 


dy = f' (x) dx, 


where dx = Ax is independent of x. 
By definition 


Фу = d(dy) = d(f' (x) ах), 


where f’ (x)dx is a function of x, so that the factor dx being independent 
of x can be taken outside the differential sign. So 


Фу = d(f' (хууах. 
Computing d(f'(x)) as the first differential of f'(x), we obtain 
d(f' (0) = (77:00) ах = f" (x) dx. 


Hence the second-order differential d?y of the function y = f(x) at the 
point x is given by the formula 


d'y = f" дах, 
where dx? stands for (dx)?, dx being the differential of the independent 


variable x at the point x. 
Mathematical induction gives the formula of the mth differential as 


d'y = f(x)dx", 


where dx” = (dx)". 
Whence 


My) = d 
f 0) = т 


Now let y = f(u) where u = ф(х) is a function differentiable sufficiently 
many times. Since the first differential retains its form, we have 


dy = f' (u)du, 
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where du = e'(x)dx is in general dependent on x. Then 


ау = d(dy) = d(/' (u)du) = d(f’ (u)) du + f’ (u)d(du) 
= f" (udu? + f' (u)a?u. 
If и is an independent variable then а?и = 0 and 
Фу = f" (u)du’. 

Comparing both expressions for d^y, we conclude that the second 
differential does not possess the property of invariance of the form. 
However, given a linear function и = ф(х) = ax + b where а and b are 
constant, the second differential retains its form. 

Differentiation of functions given in a parametric form. Suppose that 
the Cartesian coordinate xy-system is set up on a plane. Let (^ and (1) 
be continuous functions on a closed interval a < £ < 6. If we regard the 


parameter ¢ as time the functions ¢(¢) and (Г) specify the law of motion 
of a point M in the xy-plane so that the coordinates of M are given by 


р = (f) 
у= ү(0 

The set M of all points whose coordinates (х, у) on the xy-plane are 
given by the equations (*) is called the plane curve. In this case we say 
that a plane curve is given in a parametric form or is represented 


parametrically. 
For example, we can parametrically represent a circle with radius R and 


centre at the origin of coordinates by the equations 


x= R cost 
y=Rsint 


«<{<8. (8) 


Ozr«2m, 


where ¢ is the angle between the x-axis and the position-vector OM from 
the origin O to the point M(x, y) measured in radians (Fig. 8.15). 

The parametric equations of a plane curve can be reduced to the equa- 
tion F(x, y) = 0 by eliminating the parameter t. For example, if we square 
and add the parametric equations of the circle the parameter / is eliminated 
so that the circle with radius R and centre at the origin of coordinates is 
given by the equation x? + у? = R?, familiar to the reader. However, some- 
times we fail to eliminate ¢ from parametric equations. In these cases we 
need techniques and procedures that enable us to compute the derivative 
of y with respect to x when the curve is given in a parametric form. 

We say that a functional relationship between y and x is represented 
parametrically if both variables x and y are specified separately as functions 
of a parameter / so that x = e(t), у = (0, {є (а, 8). 

Let x = e(t) and y = (0) be defined and continuous for the values of 
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t in the open interval (a, 8). Suppose that there exists an inverse function 
t = р(х) of x = e(t). Then y = V[g(x)] is a composite function of x. Fur- 
thermore, we assume that ¢(f) and ү(/) are differentiable at the point £ € 
(a, В), e' (t) = 0 and t = g(x) is differentiable at a point x corresponding 
to £. Then by Theorem 8.3 the function у = V[g(x)] is differentiable at the 
point x and 





yx = Vite. 
On the other hand, by virtue of Theorem 8.5 we have 
М = 28 
Xi 
Fig. 8.15 
so that 
, D 1 2 
yee Уулыг 2t 
Xp Xt 


or 


dy_ vO (су, 
m on (e'() = 0). 


This result immediately follows when we divide the nominator and 





denominator of 2 by d! so that 


dy ауа _ 4’ (Ù) 


ах dx/dt e'(. 


22—9505 
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Example. Consider the circle represented parametrically by 





x= Rcost 
< Я 

о, на 
Тһеп 

dy _ dy/dt R cost 
E - Е шоог 

dx dxdt ^ —Rsmr: оо 
Or 2 - =- > (interpret this result geometrically). » 


If y(t) and (f) have the kth derivatives and y’ (t) z 0 then the com- 
posite function y = ¥[g(x)] has the Ath derivative with respect to x. 
The second derivative of y with respect to x is given by 


Фу d (2) _d (£9) dt 
dx? ах ах dt No' (0) ах 


-VOe (0-4 (0е"О0), 1 _ v"Qe'() v Oe" O 
(eY (N e' (t) (970) 


Of 
х 
and in general 
Б Э; 
ao? 
where y = f(x) is given in a parametric form by the equations x = x(f) and 
y = y(t). 


2 
Example. Compute = provided that y = f(x) is given by 
x 








Hence 





yi 


у = 


Ё = a(t — sin t), 


y = a(l — cos f). 











-« We have 
dy _ dy/dt_ asint _ tE 
dx  dx/dt a(l — cos t) 2 
and 
Фу 1 (2) = а гүй zal t 1 
dc dx (2 а V 2) ах dt X^ 2) аха 
TP REUS 1 "— ы T 


‚21 at 
sin 7 4a sin 3 
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8.5 Mean Value Theorems 


Theorem 8.6 (Rolle’s theorem). 7f a function f(x) (1) is continuous 
on [a, 5], (2) has a finite derivative at every point of (a, b) and (3) assumes 
equal values at the endpoints of [a, b] so that f(a) = f(b), then there exists 
at least one point Ё € (a, b) such that / (9) = 0. 
ча Since f(x) is continuous on [a, b] Theorem 7.30 implies that f(x) attains 
its maximum M and minimum m on [a, 5]. 

Two cases should be distinguished: 

(а) Let M = m. Then M < f(x) < M, ie, f(x) is a constant function 
on [a, b]. Hence f'(x) = 0 for all x in (a, Р) and the theorem is true. 

(b) Let M x m. Then f(x) attains either a maximum M or a minimum 
m at a point £ contained in (a, b) since f(a) = f(b) so that f(x) cannot 
attain its maximum M at one end of [a, b] and its minimum т at the 
other end simultaneously. For definiteness we set M = f(t), a«£«b 
(Fig. 8.16). 








Fig. 8.16 


Since f(x) has a derivative f’ (х) at every point of (a, b) there exists 
a derivative f'(£) at £ and 


a SE + Ax) — Л) р SUE - Ax) - Л) _ 6 
хэ? Ax Ee —Ax J. 
Ax»0 Ах»0 


On the other hand, f(£) = М is the maximum of f(x) on (0, b] so that 
S(E + Ax) -f(£) <0 and f- Ax) — Л) < 0. 


Whence 
f + A9 ЛӘ со ang f А0 -SO 59 (Ax > 0) 
Ах с Ах ^ | 


22* 
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Evaluating the limits as Ax > 0, we obtain the inequalities 
Sf’) <0 and /(20 


which must be simultaneously true. Hence, f' (£) = 0 and the theorem is 
proved. » 

The geometric interpretation of Rolle’s theorem is that if y = f(x) satis- 
fies all the three conditions of the theorem then the graph of f(x) is 
represented by the curve AB (Fig. 8.17) such that (1) the curve AB is con- 
tinuous on [a, 5], (2) there exists a tangent to the curve at any point between 
A(a, f(a)) and B(b, f(b)) and (3) the ordinates of the points A and B of 
this curve are equal. Rolle's theorem states that at least for one point C 
(Е, f(E)) of the curve AB there is a tangent parallel to the x-axis. 





Fig. 8.17 


By way of illustration we shall show that the conditions (1)-(3) are im- 
portant and if they are violated Rolle's theorem may not be true. Consider 
the function f(x) = Ixl, —1 <x < 1 (Fig. 8.18). For this function condi- 
tion (2) is not satisfied as f(x) has no derivative at x = 0. In this case Rolle's 
theorem is inapplicable as in (—1, 1) there exists no point at which the 
derivative f' (х) is equal to zero. Indeed, f'(x) = —1 if —1« x < 0 and 
Го) = 110 <х<1 while at x = 0 f'(x) does not exist. Looking at 
Fig. 8.19 we notice that the function f(x) 2 x — [x] does not satisfy condi- 
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tion (1) of Rolle's theorem since the point x — 1 is a discontinuity of f(x) 
and thus f(x) is not continuous on (0, 1]; the derivative f’ (x) is equal to 
1 at all points in (0, 1). 

Problems. (1) Consider the function f(x) = 1 + x"(1 — x)" where m 
and n are positive integers. Without computing the derivative of f(x), show 
that the equation f'(x) = 0 has at least one real root in (0, 1). 

(2) Prove that the equation x? + 3x — 6 = 0 has only one real root. 

Theorem 8.7 (mean value theorem)". 7f a function f(x) (1) is continuous 
on [a, b] and (2) has a derivative f' (x) in (a, b) then there exists at least 
one point £ in (a, b) such that 


ЛӘ) p, а<Е<ь 
ч We introduce an auxiliary function F(x) on [a, b] as 
Бод = Јо) - Ла) -IO (y а) 


The function F(x) satisfies the conditions of Rolle’s theorem. Indeed, 
F(x) is continuous on [a, b} since every summand in F(x) is continuous 
on [a, b]; in (a, Б) F(x) has a finite derivative, for every summand of F(x) 
has a derivative in (a, Р); and F(a) = F(b) = 0, ie, F(x) assumes equal 
values at the endpoints of [a, b]. 

Then by virtue of Rolle's theorem there exists at least one point £ in 
(a, b) such that F'( = 0 

On the other hand 


FQ) = f'o - £0. f) 


so that at the point £ 
г @-#®-=/® . 
Whence 
= ЛӘ, tela, p. > 


Rolle’s theorem is a specific case of the mean value theorem for deriva- 
tives and is easily obtained by putting f(a) = f(b) in the latter. 


? In Russian-language mathematical literature this theorem is called the Lagrange 
theorem. 
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To interpret the mean value theorem geometrically we turn to Fig. 8.20. 


It is easy to notice that the ratio ДЭ ~ Fa) is the slope of the line AB 


and f’ (£) is the slope of the tangent to у = f(x) at the point with the abscis- 
sa x — £. Hence the mean value theorem states that if AB is a continuous 
curve such that there is a tangent at every point between A and B then 
there exists at least one point C(£, f(£)) between A and B at which the 
tangent to the curve AB is parallel to the straight line AB. 

The formula 


£0) - fi) .. p (o 











ДЬ) - f(a) = (906 – а), а<Е<Ь 
is also valid for a > b. Since £ is in general unknown ії is convenient to 
represent it as 
&=a+ 0(b – а), 
where @ is a real number, 0 < 0 « 1. Then the above formula becomes 
f(b) — f(a) = f'(a + 0(b – ab – а), 0«8«1. 
Replacing a and b by x and x + Ax, respectively, we get 
Af(x) = f(x + Ax) - fœ) = f (x + 96Ax)Ax, 0«60«1. 


This is the exact expression that relates an increment of y = f(x) and an 
increment Ax whereas the relative error of the approximate relation 


Af(x) = f' О9Ах 
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tends to zero only as Ax > 0. Also notice that in the exact relation the 
number @ is in general unknown. 
Example. By use of the mean value theorem prove that 


Пап” !x—tan"! xi! &lx —xil vx, Хо. 


~ Consider the function f(x) = tan ^! x. This function satisfies the condi- 
tions of the mean value theorem on any [a, b]. Then for any x; and x: 


So) — Ло) = f' (C2 — x) 





or 
tan ! x; — tan ! x, = т Qo — x1), 
1+# 
where the point £ is between the points x; and x2. 
Whence 
ап”! x» — tan"! xil = шин Ix. — xil 
l+ 
and 
ап” x» — tan“! xil € lx; — xil 
since 1 z <1 for all £. » 
I+ 


Problem. Using the mean value theorem prove that 


X man 


Tax <n +) <x, x> -1. 


Theorem 8.8 (Cauchy mean value theorem). 7f the functions f(x) and 
ф(х) (1) are continuous on [a, Б], (2) have derivatives f'(x) and e' (x) in 
(a, b) and (3) if e' (x) = 0 in (a, Б) then there exists at least one point 
E in (a, b) such that 


f) - fa) _/'@ А 
eb а) (0° “<*<? 9 


-* From the conditions of the theorem it follows that the difference 
Ф(Б) — e(a) can not be equal to zero. Indeed, if y(b) — (a) = 0 then g(x) 
would satisfy the conditions of Rolles theorem so that р’ (x) would be 
equal to zero at least at one point £ in (a, b). However this contradicts 
condition (3) of this theorem. Hence the formula (*) makes sense. Let us 
Show that this formula holds for some £ in (a, b). 

Consider the auxiliary function 





FQ) = fœ - fla) - ЈО Д0) (еб) — ew) 
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that satisfies the conditions of Rolle's theorem. Indeed, (1) F(x) is continu- 
ous on [a, b] since f(x) and ф(х) are continuous on [a, b]; (2) Р(х) has 
a derivative F” (x) at every point in (a, b), since every summand in the right- 
hand side of F(x) has a derivative in (a, b); and (3) F(a) = F(b) = 0. 

From Rolle’s theorem we conclude that between a and b there exists 
Ё such that F'(£) = 0. The derivative F'(x) of F(x) is 


, — f’ = fb) - f(a) , 
F'(x) = 7 (x) фу ушу” (x) 
so that 


УОНА 
Ө E #9 = 9. 


Dividing both sides by ф'(&) # 0, we arrive at the desired formula 


Ө sO- „ 
e'()  e(b)- (а) 

It is easy to notice that if we put ф(х) = x the Cauchy mean value the- 
orem becomes the mean value theorem for derivatives. 

Problem. Given the differences f(b) — f(a) and (b) — e(a), could we 
derive the formula involved in the Cauchy mean value theorem by applying 
the mean value theorem to these differences? 

Remark. Rolle's theorem, the mean value theorem and the Cauchy mean 
value theorem imply that there exists some “middle” point £ € (a, b) at 
which some of the named relations are true. For this reason all these the- 
orems are collectively named the mean value theorems for derivatives. 





8.6 LHospital’s Rule 
Let functions f(x) and ф(х) be defined in a neighbourhood of a 


point х = a and let f(a) = 0 and (а) = 0. Then the quotient LO лэ ) becomes 


indeterminate at x = a while the limit of this quotient can exist at x = а. 


The notation 0 is frequently used to refer to this ambiguous situation. 


: ла го 20022 
When we wish to compute lim PTS where the quotient 20) is indeter- 


minate at x — a we shall speak of evaluating the indeterminate form E ) 


Evaluating the indeterminate form E involves the computation of 


m tim 2 provided that lim 1700) = œ and lim р(х E 
x>a Ф 
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Evaluating the indeterminate form оо — oo involves the computation of 
lim[f/(x) — ¢(x)] provided that lim/(x) = æ and lim e(x) = oc. 
x-^a rca 


xa 
We can introduce the similar notions for limits at infinity, 1.е., for limits 
as x > oo. 
Theorem 8.9 (UHospital's rule). Let f(x) and ф(х) have derivatives Л! (х) 
and e' (х) in a neighbourhood (a — 5, a + 5) of a point a except probably 
at a. Suppose that ф(х) and ф (x) are not equal to zero in (a — 6, a + ò). 


If limf(x) = 0 and lim g(x) = 0 and if the quotient f e) has a finite or 
xa xa 


e' 09 





infinite limit as x ^ a then there exists 
lim LO). = lim LOO, 
хэа (x) х-а e'(x) 


ча Theorem 8.9 says nothing about the values of f(x) and g(x) at x = a. 
We put Ла) = 0 and (а) = 0. Then lim/(x) = f(a) and ітое(х) = e(a) 


and the functions f(x) and ф(х) become continuous at a so that on 
[а, x] (or on (х, a]) where x is a point in (a — 6, a + $), f(x) and e(x) 
satisfy the conditions of the Cauchy mean value theorem. Hence, there ex- 
ists at least one point £ = (x) between a and x such that 


fe) _ Ло) Ла) f е) 
ф(х) ew- e(aà) 6'(8) 
If for some x there exist more than one such £ we choose апу of them. 
The point £ is dependent of x and £ — a as x > a. As stated, the quo- 
VH 


r 


tient 





m has a finite or infinite limit as x — a. This limit is independent 
X 


Ff 


, 


of how x tends to a. So the quotient 


of LO 
g(x) 





" has a limit equal to a limit 


as x —^ a and, consequently, as £ — a, so that 





lim © = tim) : (жж) 
pe (£ х-аф'(х) 


From the identities («) апа (жє) we have obtained it follows that 


ra ф(х) = xa p(X) 
The above identity represents L'Hospital's rule which allows, under 
some specific conditions, to replace a limit of a quotient of functions by 
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a limit of a quotient of their derivatives which is sometimes easier to 
compute. 
Example. 
lim — COS х _ lim! — cos x)’ _ lim 22 х 1 


x70 x? x20 (х) iex 2 





Remarks. (1) If the conditions of this theorem are satisfied on the open 
interval (a — 6, a) or on (a, а + 6) L-Hospital's rule is applicable to compu- 
е £0 

e(x) 

(2) It may happen than the limit of a quotient of derivatives does not 

exist while the limit of the quotient of the respective functions does exist. 


tation of the limit o as x —^ a — 0 or as x ^? a + 0, respectively. 


As an illustration we consider the functions f(x) = x? sin + and ф(х) = x. 
For the quotient of these functions at x = 0 we have 


1 





2 + 
x^ sin — 
m JO) = lim X . limx sin = 0. 
m g(x) x70 x x20 x 
On the other hand, the quotient of the derivatives 2 = = 2x sin 1- 
e'(x 

cos — 1 nas no limit at x = 0. Hence, from the existence of lim feo) it does 

x x20 ф(х) 
not necessarily follow that lim Го) exists. 

x20 e'(x 


(3) Sometimes we have to apply L'Hospital's rule repeatedly when com- 








puting lim fe S 2. For example, if f(x) and e(x) and their derivatives f’ (x) 
х-аф 

and e' (x) ай satisfy the hypothesis of Г.НояриаГ5 rule we can apply the 

rule to compute lime w and so on. 

хэаф” (x) 
Example. 

lig — Sin Х = jm & — sin x)* = lim =- cosx _ р sinx _ l. 
x20 x? x70 o)’ x20 3x ? x20 бХ 6 


Theorem 8.10. | L'Hospital's rule for an indeterminate form 2 . Let 


the functions f(x) and ф(х) have the derivatives f' (x) and e' (x) in a neigh- 
bourhood (a — 5, a + 8) of a point a except probably at a. Suppose that 
€ (x) and e' (х) are not equal to zero in (a — 6, a + $). If limf(x) = œ and 
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lim e(x) = œ and if L T has a finite or infinite limit as x > a then there 
xa e (x 


exists 


lim 9. = lim © | 
кэаф(Х) х-аф' (х) 
Here we can also consider one-sided limits as x ^ a – Оогаѕх »+а + 0 


(see Remark 1). 





Example. 
a COS ax 
Insinax _ |. (In sin ах)” |, sin ax 
x50«0 In sin bx х-о+о (In sin bx)’ х-0-0 b cos bx 
sin bx 





-9 jg (S082. sin x) ү (50 520, 
b x>0+0 \cos bx sin ах 


L'Hospital's rules are used to compute the following limits: 
(a) lim[fG)e(x)] for limf) 20 and lime(x)- e. 
ra ха xa 


-4 It suffices to represent f(x)e(x) as 


_ ЈО) _ еб) 
700600 = 17609) or fæ = 17709” 


Then the functions on the right satisfy the hypothesis of LHospital’s 
гше. » 











Example. 
lim (xInx) = lim P*= lim -= 
х-0-0 x70+0 I/x х-0+0 —l/x 


(b) lim [fC — 200| for lim/(x) = o» and lim e(x) = о. 
ча To apply L'Hospital's rule it suffices to write f(x) — ф(х) as 


_ 1 | 1 меб) = 1469 
IO) — e0 = туа) Тубу — П) ео) O 





Example. 
: х 1 р ХШХ-Х-41 
n E ых) = im (х= 0х 
; In x : “1/х 1 
= lim —— — ———— = lim Lc. 
rilnx+1—Ix ruülxedMxX 2 


(с) lim[f/(9]** for each of the following cases 
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(i) limf) = 0 and lim e(x) = 0 (09; 
(ii) “lim f(x) = 1 and lim e(x) = eo (1%); 
(iii) im f(x) = + оо and. lim g(x) = 0 (о). 
ч Put iis d 
у = ol. 
Let us consider 
In y = ф(х) In f(x) 
and evaluate 
ин у- lim[eGo)In ТОО. 
It is easy to notice that we have to evaluate a limit considered in (a) 


for each of the cases (i)-(iii). 
Suppose that limIn y = A. Then іту = e^, ie., 
xa 


xa 


іт [00] = e^. » 


х- а 


Example. Find lim x. 


x70+0 


ч Put y = х". Then In y = x In x. Whence 





lim Iny= lim PX = jim !%, =0, 
x>0+0 x20«0 1/х х+0+0 —l/x 


so that 
lim y-e?-1. > 
х-0-0 

Theorem 8.11. Suppose that (1) functions f(x) and ф(х) are defined for 

all x such that |x| is sufficiently large; (2) lim f(x) = lim ф(х) = 0 or 
x>w x>w 
lim f(x) = © and lim ф(х) = оо; (3) there exist derivatives f'(x) and 
xce 


xe 


e’ (X) (e' Qo) ж 0) for all x such that |x| is sufficiently large; (4) there exists 


a finite or infinite limit of the quotient ло) as х —^ œ. Then 


e^Q) 





Јо) 


lim £0) = jim FO , 
re p(x) xray’ (x) 
To verify that Theorem 8.11 is true it suffices to put x = 1/¢ and use 
the results of Theorem 8.9 and Theorem 8.10. 
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Example. 
NES . 2x : 2 
lim == lim == lim —-0 
хә +оо е хә +w е хә +оо C 


Applying LHospital's rule repeatedly п times, we obtain 


a 


-1 
1 Ё пх" 
lim = lim ===... = lim " p 
хо +оо е X + е хэ +00 87 


| х 





E 


so that the function e* increases faster than any power function of x as 
x> + о, 
The following example shows that L'Hospital's гше being applicable in 


КУ БЕЛЕЕ) 
evaluating lim мах turns out to be unsuitable for practical pur- 


хә +® 


poses. Indeed, applying LHospital's rule, we have 


lm ———-2 lim m= lim —— 
хә + x xc 1 хә +0 V1 + х? 
of 2 
= lim 1 = lim Se etc 
xo X +o x 


while elementary algebraic manipulations easily yield 


li V1+ x? + x? Юн 
im - 
хә xo x nm. 


8.7 Tests for Increase and Decrease of a Function 
on a Closed Interval and at a Point 


Definitions. We say that a function f(x) defined on a closed interval 
la, Б) is nondecreasing on [a, b] if given any x; and х» in [a, Б], the condi- 
tion x; < x; implies that f(xi) < Оо). 

If хү < x? always implies that f(xi) < Лоо) then f(x) is said to be in- 
creasing on [a, b]. 

We say that a function f(x) is nonincreasing on [a, b] if given any xı 
and хэ in [a, b], xı < xz implies that f(x) 2 (x2) and f(x) is decreasing 
on [a, b] if x1 < x? implies that f(x1) > f(x). 

A function f(x) is said to be monotone on [a, b] if f(x) is only non- 
decreasing, in particular increasing, on [a, b] or only nonincreasing, in par- 
ticular decreasing, on [a, 5]. 
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Theorem 8.12. Let f(x) be continuous оп (а, Б] and let f(x) have a 
derivative f' (x) on (a, b). For the function f(x) to be nondecreasing on 
[a, Б] it is necessary and sufficient that f' (x) > 0 for all x in (a, Б). 

Necessity. Let f(x) be nondecreasing on [а, b} (Fig. 8.21). We prove that 
РО) 2 0 on (a, b). 


y-f(x) 





r-Ar r 
Fig. 8.21 


ча Consider two points x and x + Ax in (a, Б). Since f(x) is nondecreasing 
then Ax and f(x + Ax) — f(x) are of the same sign for any Ax so that 


f(x + AX) — fO) o 
Ax di 


Notice that at any point x in (a, Б) there exists a derivative f’ (x). Then 
from the above inequality it follows that 


/' Од) = Дк + A) — f0 5 9, 


lim 
Ax—0 
Hence at any x in (a, Б) /'(х) 20. > 
Sufficiency. Let f’ (x) 2 0 on (a, b). We prove that f(x) is nondecreasing 
on [a, 5]. 
“4 Indeed, let x; < x? be any two points in [a, b]. By Theorem 8.7 we have 


Јов) — f6a) = F (006 — xı), 


where ху < £ < x. 
Since f’ (x) > 0 at any point x in (a, Б) then f' (£) > 0. Also, x; > xı. 
This implies that 


Тоо) 2 Ја). 
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Hence, f(x) is nondecreasing on [a, b] since f(x) < f(x;)) for 
xi «x». > 

Similarly, we can prove the following. 

Theorem 8.13. Let f(x) be continuous on [a, b] and let f(x) have a 
derivative f'(x) on (a, b). For the function f(x) to be nonincreasing on 
|а, b] it is necessary and sufficient that f' (x) < 0 for all x in (a, b). 

Therefore if f'(x) does not change its sign on some interval, f(x) is 
monotone on this interval. Also true is the following proposition: if 
f (x) > 0 in (a, Б) then f(x) increases on [a, b]. This proposition gives the 
sufficient condition for a function to increase. 

It is worth mentioning that if f(x) increases on [a, 5] then it does not 
follow that f' (х) > 0 everywhere in (a, b). 

Example. The function f(x) = x? increases on [—1, 1]. However the 
derivative f’ (x) = 3x? is equal to zero at x = 0. 

We can also speak of a function decreasing or increasing at a point. 

A function f(x) is said to be increasing at a point x — xo if there exists 
a neighbourhood (xo — ô, xo 6) of хо such that f(x) < f(xo) whenever 
x « xo and f(x) > f(xo) whenever x > xo (Fig. 8.22). 





Fig. 8.22 


Analogously, a function f(x) is said to be decreasing at a point x — Xo 
if given some neighbourhood of xo, f(x) > f(xo) whenever x < xo and 
Лх) < f(xo) whenever x > хо. 

The following theorem specifies the sufficient conditions for a function 
to be increasing or decreasing at a point. 

Theorem 8.14. Let f(x) have a derivative f' (xo) at xo. If f’ (xo) > 0 then 
fœ) increases at хо and if f'(xo) < 0 then f(x) decreases at хо. 
ча Let f’ (xo) > 0. Then 


(xo + Ax) — f(xo) 
АХ > 0. 





lim 
Ах-0 
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This means that there exists 6 > 0 such that for all Ax 
Лоо + Ax) — f(xo) 
AT > 0 





whenever 0 < 1Ах| < ô. 

Whence it follows that if 0 < 1Ах| < 6, Ax and f(xo + Ax) -- f(xo) are 
of the same sign, namely, if Ax < 0 then /( + Ax) — f(xo) «0, ie, 
/0% + Ах) < f(xo, and if Ах» 0 then /(х + Ax) — До) > 0, ie, 
Jo + Ax) > f(x). 

By definition this means that f(x) increases at xo. 

Using similar reasoning, we can show that if f’ (xo) < 0 then f(x) 
decreases at xo. > 





Fig. 8.23 





Fig. 8.24 


Notice that the function shown in Fig. 8.23 increases at x — 0; however 
the derivative of this function does not exist at x = 0. 

The function f(x) = x? increases at x = 0 and its derivative f’ (x) = 3x? 
vanishes at x — O (Fig. 8.24). 


8.8 Extrema of a Function. Maximum and Minimum 
of a Function on a Closed Interval 


Local extrema. Let f(x) be defined in a neighbourhood of a point 
xo and at хо. We say that f(x) has a local maximum at хо if there exists 
6 > 0 such that 


Af = f(x) — f(xo) < 0 
for all x in (xo — à, xo + ô) (Fig. 8.25). 
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A function f(x) is said to have a local minimum at xo if there exists 
6 > 0 such that 


Af = f(x) — Лоо) 2 0 


for all x in (xo — 6, xo + 5) (Fig. 8.26). 

The point xo at which f(x) has a local maximum (minimum) is called 
the point of local maximum (minimum). The local maximum and local 
minimum of a function are called the local extrema of this function. 





Хо Хо?д 


Fig. 8.25 





Fig. 8.26 


These definitions mean that f(xo) is a local maximum of f(x) if there 
exists an open interval (xo — 5, xo + 5) such that /(хо) is a maximum of 
f(x) on this interval and f(x) is a local minimum of f(x) if there exists 
an open interval (xo — 6, xo + 5) such that f(xo) is a minimum of f(x) on 
this interval. 


23—9505 
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The term “local extremum” is used here because we consider a maxi- 
mum or minimum of a function within some neighbourhood of a given 
point rather than over the domain of a function. For example, the function 
y = f(x) shown in Fig. 8.27 has a local maximum at xo and a local mini- 
mum at x; but f(xo) < f(x). 

In what follows we shall speak of extrema of a function omitting “lo- 
cal" for brevity. 

A function f(x) is said to have a strict maximum (minimum) at a point 
Xo if there exists 6 > 0 such that 


ЈО) — f) «0 (Ло) – Лоо) > 0) 





Fig. 8.27 


whenever 0 < lx — xol < 6. Then the point хо is called the point of strict 
maximum (minimum) of a function. Here we do not assume that f(x) is 
continuous at xo. For example, the function 


x? for xz0 
70) = F for x=0 
is not continuous at x = 0 but f(x) has a maximum at x = 0. Indeed, there 
exists 6 > 0, say 6 = 1, such that f(x) — f(0) = f(x) — 1 < 0 for all x # 0 
in (—1, 1) (Fig. 8.28). : 
Problems. (1) Using the definitions of maximum and minimum, prove 
that the function 


e^" for x0 
ЈО) = { 0 fo x=0 
has a minimum at x — 0, and the function 


-1/x 
sue xe fo х# 0 
0 юг х= 0 
has no extremum at х = 0. 
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(2) Find out whether the function f(x) = (x — xo)" ф(х) has a maximum 
or minimum or has no extremum at xo provided that (1) (x) is continuous 
at xo and (хо) Æ 0, (2) the derivative Ф’ (x) does not exist at хо and (3) 
n is natural. 

Theorem 8.15 (necessary condition for extremum). A function f(x) can 

have an extremum only at the points where its derivative f' (x) either is 
equal to zero or does not exist. 
4 Let f(x) have a derivative at хо and let f' (xo) # 0. For definiteness we 
set f’ (xo) > 0. Then f(x) is an increasing function at Хо so that there exists 
ô > 0 such that f(x) < f(xo) for all x in (xo — ё, xo) and f(xo) < f(x) for 
all x in (xo, xo + 6) (Fig. 8.29). This implies that there exists no neighbour- 
hood of xo where /(Хо) is a maximum or a minimum of f(x) and the point 
хо is neither a point of maximum nor a point of minimum of f(x). 








Fig. 8.28 Fig. 8.29 


Using similar reasoning we arrive at the same conclusion for f’ (xo) < 0. 

Hence, if there exists a derivative f’ (x) at a point xo and if f’ (xo) 50 
then f(x) can have neither a maximum nor a minimum at xo. Thus f(x) 
can have an extremum only at a point where its derivative f’ (x) either is 
equal to zero or does not exist. В» 

Figure 8.30 offers a geometric illustration of Theorem 8.15. The func- 
tion y = f(x) has extrema at the points xi, x2, x3 and x4. The derivative 
f" (x) does not exist at xı and x, and f'(x) is equal to zero at x? and x3. 

The points where the necessary condition for extremum of a function 
f(x) is satisfied are sometimes called the critical points of f(x). These are 
the roots of the equation f'(x) = 0 and the points where /' (x) does not 
exist (in particular, the points where f” (x) is an infinity). The points where 
Sf’ (x) = 0 are called the stationary points of f(x), since the rate of change 
of the function f(x) at these points is zero. 

Theorem 8.15 specifies only the necessary condition for extremum of 
a function f(x) and it does not necessarily have a maximum or a minimum 


23* 


356 8. Differential Calculus. Functions of One Variable 


at every critical point. For example, if f(x) = x? then f' (0) = 0 so that the 
point x = O is a critical point. However f(x) = x? has no extremum at x = 0 
since /(0) = 0 and f(x) « 0 for x « 0 and f(x) > 0 for x > 0, which means 
that f(x) increases at x = 0. 

The following theorems specify the sufficient conditions for a function 
to have a maximum or a minimum at a point. 








Theorem 8.16. Le! x — xo be a critical point of a function f(x) where 
either f' (xo) = 0 or f' (xo) does not exist and let f(x) be continuous at xo. 
Suppose that there exists 6 > 0 such that f' (x) > 0 for all x in (xo — 6, 
xo) and f'(x) < 0 for all x in (xo, xo + ô) so that the derivative f'(x) 
changes its sign from positive to negative at xo when x moves through xo 
from left to right. Then f(x) has a maximum at xo. 
< Since f'(x) > 0 on (xo — 6, xo) then f(x) increases on the closed interval 
[xo — 8, xo]; similarly, f' (x) < 0 on (xo, xo + ô) implies that f(x) decreases 
on the closed interval [xo, xo + 5]. Hence /(xo) is a maximum of f(x) within 
the neighbourhood (xo — 6, xo + 5) of хо (Fig. 8.31). This means that f(xo) 
is a local maximum of f(x). 9» 

Theorem 8.17. Let x = xo be a critical point of a function f(x) where 
either f’ (xo) = 0 or f' (xo) does not exist and let f(x) be continuous at хо. 
Suppose that there exists 6 > 0 such that f' (x) « 0 for all x in (xo — 6, 
хо) and f'(x) > 0 for ай x in (xo, xo + 8) so that the derivative f' (x) 
changes its sign froin negative to positive at xo when x moves through Хо 
from left to right. Then f(x) has a minimum at xo. 

The proof of Theorem 8.17 is similar to that of Theorem 8.16. 

If within some neighbourhood (xo — 6, xo + 6) of a critical point xo 
the derivative f’ (х) is of the same sign on the left and on the right of хо 
the function f(x) has no extremum at xo. Thus, if f'(x) > 0 for all x in 
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(xo — 6, xo) and for all x in (xo, xo + 6), then given any whatever small 
(Xo — ô, xo + 8) f(x) increases both on the left and on the right of xo so 
that f(xo) is neither a maximum nor a minimum of f(x) in (xo — à, xo + ô), 
ie, f(x) has neither a maximum nor a minimum at xo. 

For the sufficient conditions given by Theorem 8.16 and Theorem 8.17 
to be satisfied it is important that the function f(x) be continuous at xo. 
For example, if 


-Х for х< 0, 
70) = С. 1 for x20, 
then the derivative f’ (x) does not exist at x = 0 (Fig. 8.32). The derivative 
J’ (x) changes its sign when x moves through x = 0; however, f(x) has no 
extremum at x = 0 since there exists no neighbourhood of x = 0 where 
f(0) 2 1 would be either a maximum or a minimum of f(x). The point 
is that the function f(x) is not continuous at x = 0. 


У 











0 г 


Fig. 8.31 Fig. 8.32 


The general procedure of computing extrema of a function involves the 
following steps: 

(a) Compute the derivative f' (x) and find the roots of the equation 
/'@) = 0; 

(b) Find all points where / (x) does not exist. These points along with 
the roots of f' (x) = 0 are critical points of f(x). 

(c) Determine the signs of f'(x) on the left and on the right of every 
critical point. Then f(x) has a maximum at a critical point xo if f'(x) 
changes its sign from positive to negative when x moves through this critical 
point from left to right and f(x) has a minimum at xo if f’ (x) changes 
its sign from negative to positive at xo when x moves through this point 
from left to right. If f' (x) does not change its sign when x moves through 
Xo f(x) has neither a maximum nor a minimum at хо. 
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Examples. (1) Investigate the function у = x?e * for extremum. 

4 (a) Computing the derivative у’ of y, we have у’ = 2хе " — x'e * = 
e *Q — xx. 

(b) Solving the equation y’ = O gives the critical points x = 0 and 
x = 2. 

(c) Determining the signs of the derivative, we conclude that: (i) y’ is 
negative on the left of x = 0 and positive on the right of x = 0; (ii) y’ 
is positive on the left of x = 2 and negative on the right of x = 2. Hence, 
f(x) has a maximum at x = 2 and a minimum at x = 0 (Fig. 8.33). » 

(2) Investigate the function y = x?/? for extremum. 


*4 (a) The derivative у’ .21 


3 ms 

(b) The derivative y’ does not vanish; however it does not exist at x = 0 
so that y'(x) ^ o» as x ^ 0. Hence the only critical point is x = 0. 

(с) The derivative y’ (x) is negative on the left of x = 0 and positive 
on the right of x = 0. Hence the function f(x) has a minimum at x = 0 
(Fig. 8.34). » 


у=? 











Fig. 8.33 Fig. 8.34 


(3) Investigate the function y = x? for extremum. 
“4 (a) The derivative у’ = 3x?. 

(b) The critical point as a solution of 3x? = 0 is x = 0. 

(c) The derivative y ' (x) is positive on the left and on the right of x = 0. 
Hence the function f(x) increases at х = 0 and has neither a maximum 
nor a minimum at x = 0. » 

Remark. If a function f(x) has a minimum at a point xo this does not 
imply that /(x) necessarily increases on the right of xo and decreases on 
the left of хо. Consider the function 


х? (2 — sin 1) for х0, 
Јо) = 
0 for x=0. 
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It is easy to see that at x = 0 f(x) is continuous and has a minimum 
(Fig. 8.35). The derivative f" (x) = 2x (2 — sin 1 $ cos T is continuous 
іп any neighbourhood of x = 0 except at x = 0 and f” (x) changes its sign 
infinitely many times. The function f(x) is not monotone on the left and 
on the right of x = 0. 

Applying the second derivative to investigation of a function for extre- 
mum. The following theorem specifies the sufficient conditions for a func- 
tion to have an extremum at a point. 





Fig. 8.35 


Theorem 8.18. Let f(x) have the first and second derivatives at a point 
xo and let f' (хо) = 0 and f" (хо) = 0. Then at xo the function f(x) has a 
maximum if f" (хо) < 0 and a minimum if f" (xo) > 0. 
ча Observe that the point хо is a critical point of the function f(x) since 
УТ (xo) = 0. Let f" (xo) < 0. This implies that at xo the first derivative f’ (x) 
of f(x) decreases so that there exists a neighbourhood (xo — ô, xo + ô) of 
хо such that f’ (х) > f’ (xo) = 0 for all x in (xo — $, xo) and f'(x) < 
f' (xo) = 0 for all x in (xo, xo + ô. Hence f'(x) changes its sign from posi- 
tive to negative at xo when x moves through xo from left to right. Thus 
f(x) has a maximum at хо. 

Similar reasoning yields that f(x) has a minimum at xo if f” (хо) > 0 
at хо. Р» 

Theorem 8.18 enables us to lay down the following useful procedure 
for investigating a function for extremum. First, we find all critical points 
of a function as outlined in the general procedure given above. Second, 
we compute the second derivative f” (x) at a critical point and determine 
its sign if f” (x) exists. If at some critical point хо f" (xo) < 0, f(x) has a 
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maximum at xo and if f” (xo) > 0 at xo then f(x) has a minimum at xo. 
If the second derivative either is equal to zero or does not exist at xo then 
we can decide on the extremum of f(x) at this point by using the first deriva- 
tive of f(x) as specified by the general procedure. 
Example. Investigate the function y = e~* for extremum. 

-« We have у’ = —2xe~* so that x = 0 is a critical point of Ло). We 
find y” = —2e^* + 4x2e-* and y"(0) = —2 < 0 so that f(x) has a 
maximum at x = 0 (Fig. 8.36). » 





ч 





Fig. 8.36 








Fig. 8.37 


The absolute maximum and minimum of a function continuous on a 
closed interval. If a function f(x) is defined and continuous on a closed 
interval [a, b] then, by virtue of Theorem 7.30, f(x) attains its absolute 
maximum and absolute minimum on [a, 5]. 

If f(x) attains its maximum M at an interior point xo of [а, b], i.e., 
а < Xo < b, then M = /(хо) is a local maximum of f(x) since in this case 
there exists a neighbourhood of хо such that at all points belonging to this 
neighbourhood which are on the left and on the right of xo the values of 
f(x) do not exceed /(хо). However, the function f(x) can attain its absolute 
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maximum M at the endpoints of [a, b]. So, if we wish to find an absolute 
maximum of a function f(x) continuous on a closed interval (0, b] we have 
to find all maxima of f(x) in an open interval (a, b) and the values of 
f(x) at the endpoints of [a, b], i.e, f(a) and f(b), and choose the greatest 
value as the absolute maximum of f(x) on [a, b]. 

The absolute minimum of a function f(x) continuous on [a, 5] is the 
least value of f(x) among all the minima of f(x) in (a, b) and the values 
of f(a) and f(b). In the case of a function shown in Fig. 8.37 we have 
M = f(b) and m = f(xo). 

Example. Suppose that we have a square sheet of steel whose side is 
a and we wish to make a box of maximal volume by cutting out four equal 
squares at the vertices of the given sheet as shown in Fig. 8.38 and flanging 
the sheet. How should we choose the size of the cut out squares to get 
a box of maximal volume? 








Fig. 8.38 


9 The volume of a box being a function of x is given by 


u(x) = x(a — 2x)’, 0<х< 


NIR 


The first derivative of v(x) is 


È = (a — 2x)? - 4x(a — 2x) = (a — 2a — 6x) 
so that the critical points of v(x) are x1 = a/6 and x = а/2. 
The open interval (0, a/2) contains the critical point xı = a/6. 


The second derivative of u(x) is 


а? 
di = —2(a — 6x) – 6(a — 2x). 
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2 
We have T <Oatx= A so that at this point the function v(x) has 
x 


the maximum 


v a 271. 2) _2@ 
6 6 \3 7277 


At the endpoints of (0, 2/2] we have v(0) = v(a/2) = 0. 

Therefore, we get the maximum of v(x), ie., the box of maximal volume, 
by cutting out four equal squares with side x = 2/6 from the given sheet 
of steel. In this case the maximal volume of the box is 247/27. > 


8.  Investigating the Shape of a Curve. 
Points of Inflection 


Convexity of a curve and points of inflection. Let a curve be speci- 
fied by a function у = f(x) and let y = f(x) have a finite derivative f" (xo) 
at a point xo so that at the point Mo(xo, f(xo)) of the curve there exists 
a tangent which is not parallel to the y-axis. 


Хо-д Ig 





Fig. 8.39 


We say that a curve is convex downward at a point Мо if there exists 
a neighbourhood (xo — 6, xo + 5) of a point xo such that all points of the 
curve with abscissas contained in (xo — ô, xo + ô) lie above the tangent to 
this curve at Mo (Fig. 8.39). 

A curve is said to be convex upward at a point Mo if all points of this 
curve with abscissas in some neighbourhood of хо lie below the tangent 
to the curve at Mo (Fig. 8.40). . 
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Let y = f(x) be a function differentiable on an open interval (a, b). We 
say that the graph of y = f(x) is convex upward (downward) on (a, b) if 


the graph does not lie above (below) the tangent to y — f(x) for any x in 
(a, b). 











Fig. 8.40 








Fig. 8.41 


The point Mo(xo, f(xo)) is called the point of inflection of a curve 
y = f(x) if there exists a neighbourhood (xo — ô, xo + 5) of хо such that 
for all x in (xo — 5, xo + ô) the curve is convex upward if x < xo and convex 
downward if x > xo or vice versa (Fig. 8.41). In other words, the point Mo 
is the point of inflection if on the left and on the right of Mo the curve 
lies on different sides of the tangent at Mo. 
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Now we shall consider an analytical procedure for investigating the 
sense of convexity and points of inflection of a curve. 

Let us choose a point on a curve y = f(x) and a point on the tangent 
to y = f(x) at Mo(xo, f(xo)). Suppose that these points have the same abscis- 
sa x and y is the ordinate of the point chosen on the curve while Y is the 
ordinate of the point on the tangent (Fig. 8.42). Evidently, if y - Y > 0 
for all x # xo in a sufficiently small neighbourhood of хо the curve is convex 
downward at Mo and if y — Y « 0 for all x z xo in a sufficiently small 
neighbourhood of хо the curve is convex upward at Mo. Hence, to determine 
the sense of convexity of a curve at Mo it suffices to investigate the sign 
of the difference y — Y in a neighbourhood of хо. 


y 











Fig. 8.42 


Since the equation of the tangent to the curve is 
Y — Јо) = /' (xo)(x — xo) 
we have 
y — Y= f(x) — [fGxo) + /' (xo) — xo)l 
= [f – fo] — /' eo) — xo) 


Or 
A=y- Y= [Ло + А) — f(xo)) — f' Оо, 
where Л = x — xo. 
Let f(x) have the second derivative f” (x) at xo and in some neighbour- 
hood of xo. Applying the mean value theorem (Theorem 8.7) to the above 
identity, we get 


A = f' (xo + Oh)h — f' (xo)h = [ f" (xo + 8h) — f’ (xo)lh, 
where 0 = O(h) and 0 < 0 < 1. 
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Since f" (x) exists at xo then 
lim /' (Xo + Ax) — f' (xo) 








am Ax = f" (xo) 
so that 

tim 7 бө 5009-7069. pe (x5) 

6h 0 0h 


and 
УТ (xo + 0h) — f' (xo) = f" (xo) 8h + a(8h)-0h, 


where a (0h) > 0 as h > 0. 
Then we can write 


A-y-Y-[f"(x) + a(00)0n. 





y 
у=х* 
+ 
flx >20 ==> Mol Xo f(x) 
М,(Хо/(хо)) 
ffxo)«0 => > 
Fig. 8.43 Fig. 8.44 


Let f” (хо) # 0. Since o(0A) is an infinitesimal as h > 0 there exists 
$ > 0 such that f” (xo) + a(8h) is of the same sign as f” (xo) and ӨЛ? > 0 
whenever 0 < th! < 6. Hence, if f" (хо) > Othen y — Y > 0 for all x suffi- 
ciently near to xo and the curve y = f(x) is convex downward at the point 
Mo(xo, f(xo)) and if f" (х) < 0 then у = f(x) is convex upward at Mo 
(Fig. 8.43). Whence follows the necessary condition for a point to be a point 
of inflection: the point Mo(xo, f(xo)) may be the point of inflection of the 
curve y = f(x) only if f" (хо) = 0 or if f" (xo) does not exist. 
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This condition is not a sufficient one. For example, if f(x) = x* then 
f" (x) = 12x? and f" (0) = 0; however, the point O(0, 0) is not a point of 
inflection of y — x*: at this point the curve is convex downward (Fig. 8.44). 

The following theorem gives the sufficient condition for a point to be 
a point of inflection of a curve. 

Theorem 8.19. Let f(x) have the second derivative in a neighbourhood 
of xo and let the second derivative of f(x) be continuous at xo. Then the 
point Mo(xo, f(xo)) is a point of inflection of the curve y = f(x) if 
Jf" (xo) = 0 and f" (x) changes its sign at хо when x passes through xo. 
“4 Indeed, let f" (хо) = 0 and let there exist a neighbourhood (xo — ô, 
Xo + 6) of xo such that the sign of f" (x) for all x « xo differs from that 
for all x > хо. This means that the sense of convexity changes at 
Mo(xo, f(xo)) so that Mo is a point of inflection. 

Suppose now that f" (xo) = 0 and there exists some neighbourhood of 
Xo such that f" (x) is of the same sign for all x « xo and all x > xo. Then 
Mo is not a point of inflection since the curve is convex downward at Mo 
if f" (x) > 0 both on the left and on the right of xo and the curve is convex 
upward at Мо if f" (x) < 0 both on the left and on the right of хо. » 

Problem. Let f(x) have a finite third derivative at xo and let f” (xo) = 0 
and f" (хо) # 0. Prove that Mo(xo, f(%o)) is a point of inflection of the 
graph of f(x). 

Sometimes it happens that at the point of inflection Mo(xo, f(xo)) the 
tangent to the curve y = f(x) is vertical, i.e., parallel to the y-axis, so that 
f" (x) does not exist at хо. For example, if f(x) = x"? then f’ (х) = 4 x 4m 


E ЖЕ. 

and f" (x) = 9 ans 
at x = 0 f” (x) does not exist. Let us investigate the sign of f” (x) in a neigh- 
bourhood of x = 0. We have f" (x) > 0 for all x in (— ё, 0), 6 > 0, and 
Jf" (x) < 0 for all x in (0, 8) so that the curve is convex downward on the 
left of the point O(0, 0) and convex upward on the right of O(0, 0). Hence, 
the point О(0, 0) is the point of inflection of y = x'”?. The tangent to 
this curve at O(0, 0) is perpendicular to the x-axis, for lim УЛ (х) = + оо, 

xm 


Clearly, there exist no points where f” (х) = 0 and 


Finally, we can state the following sufficient condition for a point to 
be a point of inflection. Let y = f(x) have a tangent at Мо(хо, /(Хо)) which 
can be parallel to the y-axis. Let f(x) have the second derivative continuous 
in a neighbourhood of xo except probably at xo. If f" (x) is equal to zero 
or does not exist at xo and if f" (x) changes its sign when x passes through 
хо then Мо(хо, f(xo)) is the point of inflection of у = f(x). 
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8.10 Asymptotes of a Curve 


Consider a curve having an infinite branch (Fig. 8.45). An asymp- 
tote of an infinite branch of a curve is a line whose distance 6 to a point 
M on the curve tends to zero as M recedes infinitely distant from the origin 
of coordinates. 

Vertical asymptotes. A line passing parallel to the y-axis through the 
point x = xo on the x-axis is a vertical asymptote of the graph of y = f(x) 
if at least one of the identities 


lim f(x) = +0 or lim ДХ) = +оо 
x—xo—0 X—xo*0 


holds. 








Fig. 8.45 


Clearly in this case the distance 5 = |х — xol between the point 
M(x, f(x)) on the graph of y = f(x) and the line x = xo tends to zero and 
M recedes infinitely distant from the origin of coordinates. 

For example, the graph of y = 1/x has the vertical asymptote x = 0 since 

lim Le —o and lim en + co (Fig. 8.46). Similarly, the graph of 
x20-0X x20«0X 


y =e!” has the vertical asymptote x = 0 for іт e! = + ә. 
x2040 


Figure 8.47 illustrates mutual positions of a curve and its vertical 
asymptotes. 

To find vertical asymptotes of a curve y — f(x) one should proceed as 
follows: 

(1) Find discontinuities of f(x) on the x-axis. 

(2) Isolate all those discontinuities where at least one of the one-sided 
limits of f(x) is equal to + оо or — оо, Let these be x1, Хо, ..., Xm. Then 
the lines x = x1, X = X2, ..., X = Xm will be the vertical asymptotes of the 


368 8. Differential Calculus. Functions of One Variable 




















Fig. 8.46 
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Fig. 8.47 


8.10 Asymptotes of a Curve 369 


graph of y — f(x). For example, the vertical asymptotes of the graph of 





the curve y — zd 1 are the lines x = —1 and x = 1 (Fig. 8.48). 
x 


The vertical line x = xo (xo being the endpoint of the interval where 
f(x) is defined) is the vertical asymptote of y = f(x) provided that (i) xo 
is the left end of the interval and 


lim Д) = +% or lim ДХ) = -æ 


х» хо+ 0 x>x +0 
or (ii) xo is the right end of the interval and 


lim f(x) +æ or lim JO = — 00, 


хэ хо 0 эхо – 











Fig. 8.48 


For example, the function y = In x is defined on the interval 0 « x « 


+œ and lim Inx = -œ so that the line x = 0, i.e., the y-axis, is the 
x70+0 


vertical asymptote of the graph of y = In x. 

Oblique asymptotes. Let y = f(x) be defined for all x > a (x < a). Sup- 
pose that the line y = kx + b is an asymptote of the graph of y = f(x). 
Then the asymptote y = kx + b is called the oblique asymptote. 


24—9505 
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For definiteness we consider whatever large positive x. If the line 
y = kx + b is an asymptote of у = f(x) then, by definition, the distance 
6 from the point M(x, f(x)) on the curve у = f(x) to the line y = kx + b 
tends to zero as x > + со. Let a be an angle (o x 7/2) made by the asymp- 
tote with the x-axis (Fig. 8.49). Evidently, 6 = |[MNIcos о. Since cos а # 0 
then |IMNI tends to zero as ô tends to zero as x ^ + оо and vice versa. 
Noting that IMNI = | f(x) — kx — bl we infer that the line y = kx + b 
is an oblique asymptote of the graph of y = f(x) if and only if 

lim (f(x) - kx- b) 20 
x> +o 


so that f(x) admits a representation of the form 
Лх) = kx + b + а(х), 
where lim a(x) = 0. 


x> +0 





Fig. 8.49 


Observe that if there exists an asymptote y = kx + b of the curve 
y = х) as x ^ + оо then the function y = f(x) is “nearly” a linear func- 
tion as x > + œ, i.e., y = f(x) differs from y = kx + b by an infinitesimal 
as x — + о. - 

Theorem 8.20. For the graph of the function у = f(x) to have the ob- 
lique asymptote y = kx + b as x ^ +œ it is necessary and sufficient that 
there exist both 

lim 9 = к and lim [fW — kx] = b. 
хэ. X x o 
«8 Necessity. Let the graph of y = f(x) have an asymptote y = kx + b as 
X — + оо so that 
Лх) = kx + b + a(x), 
where а(х) ^ 0 as x 2 + о. 
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Then 


хә + xoxo 


lim Æ = lim [+20] = к 
x x x 
and 
lim (/0)-41- lim [b+aQ1=8 


i.e., both limits we are seeking for do exist. 
Sufficiency. Let both limits mentioned in Theorem 8.20 exist. Since 
lim [f(x) — kx] = b the difference f(x) — kx — b becomes an in- 
X Ro 


finitesimal as x ^ + œ. Denote this difference by a(x). Then we can write 
f(x) = kx + b + а(х) where a(x) > 0 as x ^ + о. Whence it follows that 
the graph of the function y = f(x) has the oblique asymptote y = 
kx + Б. > 

The proof is similar if x ^ — oo. 








Fig. 8.50 


x2 


Example. Prove that the graph of y = x 





Ї has an oblique asymptote. 





2 
x i has the vertical asymptote x = 1 (Fig. 8.50). 


«8 The graph of y = E 


where 





2 
EORNM Е ee 


Let us write this function if f(x) = i aA 


24* 
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1 
x- 1 
be expressed as 


Д(х) =x + 14 а(х), 


tends to zero as x > со. Hence, the function being considered can 





where a(x) = 70 as x — oo. 


2 
Whence we conclude that the graph of the function y — ҮЛ has the 


oblique asymptote y = x + 1. » 
It is worth noting that the graph of y — f(x) lies below its asymptote 
if A = f(x) — kx — b < 0 and the graph lies above the asymptote if A > 0. 


Horizontal asymptotes. If the function f(x) has the finite limit 
lim Д) -5( lim f(x) = b) as x ^ +æ (x – оо) then the line 


хә +0 
у = Б is the horizontal asymptote of the right (left) branch of the graph 
of y = f(x). 

Clearly, a horizontal asymptote is a specific case of an oblique asymp- 
tote provided that k = 0. 











Fig. 8.51 


Examples. (1) Let y = 1/x. Since lim 1/x = 0 the graph of the function 


y = 1/x has the horizontal asymptote у = 0. 
(2) Let y = tan! x so that lim tan ! x = я/2 апа lim tan !x-— 


I> + хә – = 


— 1/2. Hence, the right-hand branch of the graph of у = tan! x has the 
asymptote y = 7/2 and the left-hand branch has the asymptote y = — 1/2 


(Fig. 8.51). 
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(3) Let y = mx and y(0) = 1. Since lim mx = 0 the line y = 0 is the 


о 


horizontal asymptote of the graph of the function у = mx (Fig. 8.52). 


As easily follows from the above example the graph of the function 
y = f(x) can intersect its asymptote infinitely many times. 

Problem. Derive the condition for asymptotes of the graph of the rational 
function to exist. 








Fig. 8.52 


8.11 Curve Sketching 


In general the process of investigating a function and sketching its 
graph involves a sequence of steps which enable us to determine the basic 
properties of the function being considered. These are: 

(a) The domain of definition of the function. 

(b) The discontinuities of the function and their kinds. The vertical 
asymptotes. 

(c) The point symmetry and the translation symmetry, i.e., whether the 
function is even or odd and whether the function is periodic or not. 

(d) The points of intersection of the graph with the axes of coordinates. 

(e) The behaviour of the function at infinity and the horizontal and 
oblique asymptotes. | 

(f) The intervals where the function is monotone and the points where 
the function has extrema. 

(g) The intervals where the graph is convex downward and is convex 
upward and the points of inflection. 

Based on the results of the steps (a)-(g) we can easily plot the graph 
of the function. To illustrate our approach to curve sketching we consider 
the following examples of investigating functions and constructing their 
graphs. 

1 
1+ x? 
(a) The domain of definition is the number line. 

(b) There exist no discontinuities and no vertical asymptotes. 


Examples. (1) y = (Witch of Agnesi). 
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(c) The function is even, i.e., f( — х) = f(x) so that the graph is symmetric 
relative to the y-axis. The function is not periodic. Since the function is 
even it suffices to construct the graph for x > 0 and reflect this graph with 
respect to the y-axis. 

(d) The graph lies above the x-axis since y = 1 at x = 0 and y > 0 for 
all x; y # 0.- 

(e) The graph has the horizontal asymptote y = 0 since lim f(x)- 

X x00 


i ao = 0. There are no oblique asymptotes. 
I> +оо 
(f) The first derivative of y is y' = — 22282, 
(i + x^) 
for x < 0 and decreases for x > 0. The point x = 0 is critical. The derivative 
y’ changes its sign from negative to positive when x moves through the 
point x = 0 from left to right. Hence, the function has the maximum 
y(0) = 1 at x = 0. This almost immediately follows from the inequality 


Хо) = 





so that f(x) increases 





7 <1, holding for all x. 

2 
EE 
(1 + x^) 
x = 1/V3 and x = —1/V3. The graph is convex downward for x > 1/V3 and 
convex upward for x < 1/У3 since у” > 0 in the former case and y" < 0 
in the latter. Hence, x = 1/У3 = V3/3 is the point of inflection. By virtue 
of symmetry the point x = —1/V3 = — v3/3 is also the point of inflection. 

It is convenient to represent the results in the tabular form 


(g) The second derivative y" = — vanishes at the points 


Table 8.1 








The arrows 7 and N refer to the increase and decrease of the function, 
respectively. The graph of the function is given in Fig. 8.53. 

ox 1 

(2) у= х + x 


(a) The domain is the number line except the point x = 0. 
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(b) The discontinuity is the point x = 0. We have 


lim (2 +1) = +œ and 


x70+0 


lim (+4) = -o 
x70-0 x 


so that the line x = 0 is the vertical asymptote of the graph. 





Fig. 8.53 


(c) The function is neither even nor odd and it is not periodic. 
3 
x +1 








(d) Putting y = 0, we get x? +1 = 0 ог = 0; whence x = -1 
so that the graph of the function meets the x-axis at the point x = -1. 

(e) There are neither oblique nor horizontal asymptotes since 

lim LO) _ lim (=+) = +, 
хэс X хо x x 
Р не. 1 28-1 : 
(f) The first derivative is у’ = 2x - — = ;— so that the point 
x x 


І 
v2 


at the point x = 5 ; hence, at this point the function has the minimum. 
2 


x= is the critical point. The second derivative y” = 2 + 2 is positive 
x 


3 
(g) The second derivative y" — 25-01 
х 


vanishes at x = —1 and 
changes the sign from positive to negative when x passes through the point 
x = —1 from left to right. Thus the point x = —1 is the point of inflection. 
Since y" > 0 for x in (—«, —1) and for x in (0, +) and y" < 0 on 
the interval —1 « x « 0 the graph is convex downward on the open inter- 
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vals (— o», —1) and (0, + о) and is convex upward on the open interval 
-1« х<0. 

The graph of the function is given in Fig. 8.54. 

@)yax4 BE. 

(a) The domain is the positive number line x > 0. 

(b) There are no discontinuities in the domain of definition. 





Fig. 8.54 


We have lim (х + hx) _2 so that the line x = 0 passing 
х-0-0 х 


through the endpoint of the domain of definition is the vertical asymptote 
of the graph of the function. 
(c) The function is neither even nor odd nor periodic. 


44) Putting y = 0, we have x + эх = 0 and x? + In x = 0. The solu- 


tions of these equations which can be obtained from the graph shown in 
Fig. 8.55 define the point of intersection of the graph under investigation 
with the x-axis. 
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(e) Since 
іт 709. lim (1+ 53) =1=k 
x> +0 x X +w x 
and 
lim 1/9-Ю1- lim 2% =o, 
X +00 хэжф X 


the line y = x is the oblique asymptote of the graph of the function. 








Fig. 8.55 


2 — 
(f) The first derivative 15 y' = 1 + L — ш х шон = In x 
x x x 
easily see from Fig. 8.56 that x? + 1 » In x for all x > 0, Hence, y’ > 0 
for all x and f(x) increases on the interval (0, + ©) and has no extrema. 


2 1 2Inx _2inx -3 


(g) The second derivative y" = — 5 —-— + 1 : 
x x x x 


ishes at the point x = e? and changes the sign from negative to positive 
when x passes through х = e? from left to right. Hence, x = e° is the 
point of inflection. 

The graph of the function is shown in Fig. 8.57. 


. We сап 








van- 


(4) yaxt4. 
x 


(a) The domain is the number line except the point x = 0. 
(b) The point x = 0 is the discontinuity of the second kind. The line 
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Fig. 8.56 





Fig. 8.57 
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x = 0 is the vertical asymptote of the graph of the function, for 
lm (x+ 3) = +0, 
x70+0 x 
(c) The function is neither even nor odd nor periodic. 
(d) Putting y = 0, we have x? + 1 = 0; whence x = —1 so that the 
graph of the function intersects the x-axis at the point x = —1. 
(e) The graph has the oblique asymptote y = x since 


tim 709. tim (1+5) -1-6 
x 





х-жю X Xx xo 
and 
lim [fo)-x= lim L-0-5 
хә xo хэ xo X 
: Y 2 x -2 : 3 
(f) The equation у’ = 1 — a = P = 0 gives x” — 2 = 0 so that 


Im. ЭС : : 
x — V2 is the critical point of the function. 








Fig. 8.58 


The second derivative y" = 6/x* is positive everywhere in the domain 
of definition of the function and, consequently, at x — V2. Whence it fol- 
lows that the function has the minimum at x — V2. 

(в) Since the second derivative y" = 6/x* is positive everywhere in the 
domain of definition the graph is convex downward. 

The graph of the function is given in Fig. 8.58. 
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(5) у = Ах - 3). 

(a) The domain is the number line. 

(b) The function is continuous everywhere in the domain of definition 
and its graph has no vertical asymptotes. 

(c) The function is nonperiodic and neither even nor odd. 

(d) The function vanishes at the points x = 0 and x = 3. 

(e) The graph of the function has the oblique asymptote y = x — 2 since 








3f 2 
lim цэн lim Nx 3» шү 


and 


lim [fG) -x] lim [Mx(x 3)2 - x] 
2 lm x(x-3)-x 


= —2 =b. 
хе My (x — 3) + xNx(x — 3)? + x? 











Fig. 8.59 


(f) The first derivative 


‚_ @= 39? £2x—-3) х—1 
y 3Ix(x — 3777 х273(х — 3) 
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vanishes at the point x = 1 and does not exist at the points x = 0 and 
x = 3. у’ is of the same sign on the left and on the right of the point 

= 0 so that the function has no extremum at x = 0. On the other hand 
the first derivative changes its sign from positive to negative when x passes 
through the point x = 1 from left to right so that the function has a maxi- 
mum at x = 1. 

At the point x = 3 the function has a minimum since the first derivative 
changes its sign from negative to positive when x passes through the point 
x — 3 from left to right. 

(g) The second derivative 


” 2 
SEN ZEE 


does not exist at the point x = 0 and changes its sign from positive to nega- 
tive when x passes through the point x = 0 from left to right so that the 
point x — 0 is the point of inflection and the graph of the function has 
the tangent parallel to the y-axis at (0, 0). The point x — 3 is not the point 
of inflection. The graph of the function, shown in Fig. 8.59, is convex up- 
ward in the half-plane x > 0. 


8.12 Approximate Solution of Equations 


We wish to compute the real root of the equation 
70) = 0. 


Suppose that the following conditions are satisfied: 

(1) the function f(x) is continuous on a closed interval [a, 5]; 

(2) the values f(a) and f(b) are of opposite signs so that f(a); f(b) < 0; 

(3) there exist derivatives f’ (x) and f” (x) on [a, b] retaining their signs 
on [a, 5]. 

By virtue of conditions (1)-(2), Theorem 7.27 says that f(x) vanishes 
at least at one point £ in (a, b); this means that the equation being consid- 
ered has at least one real root £ in (a, b). Furthermore, since f'(x) retains 
its sign on [a, b] the function f(x) is monotone on [a, b] so that the equation 
has the unique real root £ in (a, b). 

We shall consider an iterative method of computing the unique real root 
of the equation f(x) = 0 with any predetermined precision. 

Four cases have to be distinguished for [a, 5]: 

(а) f' (x) > 0 and f” (х) > 0; 

(b) Р(х) > 0 and f” (x) < 0; 

(c) f’ (x) < 0 and /”(х) > 0; 

(d) f' (x) < 0 and f" (х) < 0. 
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The graphs corresponding to cases (a)-(d) are given in Fig. 8.60. 

Let us turn our attention to the case (a) (Fig. 8.61). We have f' (x) > 0 
and f" (х) > 0 on [a, b]. Draw the secant through the points A(a, /(а)) 
and B(b, f(b)). The equation of the secant is 


У-Ла) х-а 





fb -fa b-a' 





(с) (а) 
Fig. 8.60 


The point а, at which the secant AB intersects the x-axis lies between 
a and £, being a better approximation to £ than a. Substituting y = 0 into 
the equation of the secant AB, we easily obtain 


_ Ло) — а) 
f(b) – f(a) ` 
Look at Fig. 8.59. It is easy to observe that at a; the signs of f(x) and f" (x) 
are opposite. 
Now we draw the tangent to the curve y = f(x) at the point B(b, f(b)). 
Notice that at this point f(x) and f” (х) are of the same sign; this is a very 
important condition: if f(x) and f” (x) are of opposite signs at B(b, f(b)) 


а = а 
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1 


it can happen that the process of approximating the root diverges. The 
point b, of intersection of the tangent with the x-axis lies between b and 
Ё and offers a better approximation to £ than b. The tangent is given by 


the equation 
y — f(b) = f' (Бух – b). 
Then putting y = 0, we get 





-b - Ў) ; 
bi-b f) Cf* (b) = 0). 


Therefore we һауе a < а < Ẹ < bı < b. 


й 








Fig. 8.61 


Suppose that the absolute error of the approximation £* to £ is preas- 
signed. We can estimate the precision of the approximate values a; and 
bı by comparing the value |b: — ail with the preassigned error. If 
Ib; — ail is greater than the preassigned error then we choose the closed 
interval (ат, bı] and repeat the process by computing the new approximate 
values for the root £ as 


. Ха), - a) 
fb) - f(a) 


0 = а 


апа 





_ p ЛДЫ) ; 
b2 = bı Te) (Л Фу) # 0), 


where a < ai < ma < £ < bı < bı < b. 
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Repeating the process, we finally obtain two sequences of the approxi- 
mate values for the root 


а<а1< 0 <... «<а <... <E 





апа 
р> р> > ... > Б> ...> Ё, 
where 
2 | (an - ))(bs - 1 — da - 1) 
Tc T S bec) d sik. ' 
ЛЬ, - 1) 
bn = В, = L——— = а, bb = Б, п = 1, 2, 3, ...). 
1 7 (8-1 (do = a, bo n ) 


The sequences {an} and {b,}] are monotone and bounded; hence they 

have limits. Let 
lim а, = а and lim db, = f. 

It is not hard to verify that a = 8 = £ where £ is the unique real root of 
the equation f(x) = 0 provided that the conditions (1)-(3) hold. 

Example. Compute the root £ of the equation x? — 1 = 0 on the closed 
interval [0, 2]. 
<~ Evidently, £ = 1. Let us apply the process outlined above. For the func- 
tion f(x) = x? — 1 the following conditions are satisfied: 

(1) f(x) is continuous on [0, 2]; 

(2) f(0) = –1 «0 and f(2) = 3 > 0 so that f(0)-/(2) < 0; 

(3) f' (x) = 2x and f” (x) = 2 retain their signs on (0, 2]. 

Thus there exists the unique root £ of the equation x? — 1 = 0 on 
[0, 2]. Consequently, the process is applicable in this case. 

We have a = 0 and b = 2. For n = 1 the recurrence formulas for an 
and b, give 





29-.CD2.1., 1 
шан зэ its 
3 y. 1 
Бүхээг ээд ЛЭ 
For n = 2 we get 
Sd Sia 
a = 1 14 апа b-ltag 


which are the approximate values for £ with the absolute error A(E*) < 
01. » ? 


8.13 Taylor's Theorem 


Taylor's formula for polynomials. Consider the nth degree poly- 
nomial 


Р(х) = bo + bix + box? +... + bax” (bs x 0), 


where bo, bi, b2, ..., b, are constant coefficients. 

We can express P(x) as the expansion in powers of x — a, with some 
coefficients, where a is an arbitrary number. Indeed, put x = a + t. Then 
P(x) becomes 


Р(х) = P(a + t) = bs + bi(a + D +... + brila + 1)". 
On opening the brackets and collecting the similar terms, we can write 
P(a + t) = Ao + Ait + Аі? + Аз? +... + Ant” 
or, substituting x — a for t, 
Р(х) = Ao + Ai(x — a) + Az(x — ay + As(x — ay 


+... + A,(x — а)", (*) 
where Ао, Aj, ..., An are coefficients dependent on the original coeffi- 
cients bo, bi, ..., bn. 


Differentiating Р(х) in (+) n times, we obtain 


P'(x) = Ai + 2Ax(x — a) + 3As(x – a)? + ... + nA«Q — ay |, 
P"(x) = 2х 142 + 3x 2A4s(x — a) 4 ...« n(n – DAs( — ay ?, 


Р(х) = n(n— 1) ... 2x lAn. 
Putting x = a in P(x) and in the above identities, we have 
P(a) = Ao, P'(a) = 11:А;, P"(a) = 2A», ..., P? (a) = n As. 
Whence 





_ _ P'(a) _ P" (a) _ P? (a) 
Ao = Р(а), Ace Аз = ap эээ е 
Therefore we can express Р(х) as 
ро) = Pla) + P (х - а) + 0 (х- а)? 
РЭ)а) n 
Tori t— & — a) : 


This is Taylor's formula, in powers of x — a, for the given polynomial Р(х) 
of degree п or Taylor's formula for P(x) at the point а. 


25—9505 
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If we put a = 0 we obtain the specific case of Taylor's formula 


PO, POs PONO) S 
5) un T СА 
which is called Maclaurin’s formula. 
Example. Expand the polynomial P(x) = x? — 3x + 2 in powers of x 
and in powers of x — 1. 
ча Applying Maclaurin's formula, we get 


P(x) = х? — 3x +2 > P(0) = 2, 
P’ (x) = 2x – 3 = Р'(0) = –3, 
Р"(х) = 2 э Р” (0) = 2, 


Р(х) = Р(0) + x+ 








and 


Р: 3 25a a 2 

Р) =2—т{ Х +5 = 2 – 3х + х“, 
so that the expansion of P(x) in powers of x is identical to P(x) itself. 
To expand P(x) in powers of x — 1 we apply Taylor's formula and get 


Р(х) = х? – 3х + 2 ә Р(1) = 0 
Р'(х) = 2x – 3 = Р'(1) = -1, 
Р" (х) = 2 = Р"(1) = 2 


апа 


Р) = 0- 0-0) +5 Z œ- P= -e-+ с 1). » 


Notice that Taylor's formula gives the value of P(x) at any point x 
provided that the values of P(x) and all its derivatives at some point a 
are known. 

Taylor's formula for arbitrary functions. Now we consider a function 
f(x) defined in a neighbourhood of x = a; the function f(x) may not be 
a polynomial of degree (n — 1) but is supposed to have derivatives up to 
order n in this neighbourhood. 1 

Let us compute the values f(a), f’ (а), ..., /" (а) and use them to 
construct the function 


n-1) 
Orie) = fla «e o +... +O 


n-1 


(x – а) 


Evidently, О» – 1(х) is Taylor’s polynomial of degree п — 1 for the function 
fo. If the original function f(x) were a polynomial of degree (п — 1) then 
we would have the identity f(x) = О, – (х) for all x in the neighbourhood 
in question. However, in the case under consideration the identity does not 
hold since, by the hypothesis, f(x) is not a polynomial of degree (n — 1). 
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Let us put 
Хо) = Qn - 10) + Ёл. 


This equality is called Taylor’s formula for the function f(x) in the neigh- 
bourhood of the point x = a or Zaylor's formula for the function f(x) at 
the point a, and R,(x) is called the nth remainder of Taylor's series. 

The remainder R,(x) can be expressed in terms of the nth derivative 
of the function f(x). For the purpose we assume that f(x) is not a polynomi- 
al of degree (n — 1) and possesses continuous derivatives up to order 
(n — 1) on [a, 5] and there exists the nth derivative of f(x) in (a, b). Sub- 
stituting b for x in Taylor's formula for the function f(x) at the point a, 
we can write 


fü) = fa) + O  - а +O (p - ay 


doe FORM 


(n — 1)! 
We shall represent R, in the form 
R, = M(b - a)", 


where M is the quantity to be defined. 
To this end we consider the auxiliary function 





ec = 700) - [ло + @- 9 +5 Ф 
$5 02—700 а" 


defined on [а, b]. The function ф(х) is obtained by replacing a by х in 
the right-hand side of f(b) and subtracting the right-hand side from f(b). 

Observe that ф(х) satisfies Rolle’s theorem (Theorem 8.6). Indeed, 

(a) ф(х) is continuous on [a, b] since the original function f(x) and 
all its derivatives up to order (n — 1) are continuous on [a, 5]; 

(b) (х) has a derivative on (a, Б) since the original function f(x) has 
the nth derivative on (a, b); 

(с) ф(х) assumes equal values at the endpoints of [a, 2]: computing 
ф(х) for x = a and x = b gives (a) = 0 and e(b) = 0. 

Then by virtue of Rolle's theorem there exists a point £ in (a, Р) such 
that e'(£) = 0. 

Computing the derivative o'(x), we get 


ees -[ro- m +00 - zs LO 20 - 4 


25* 


388 8. Differential Calculus. Functions of One Variable 


E + yc 6.2) (x) (b — x)" a ie (x) (n – D(5 22 хуу? 





(1-2) (0-1) 
( ma юэ 
элсэн ! - Mn(b - x) | 
so that 
p'o = -£ A; (b — х)" + Mn(b - x^. 
Therefore 
«(= -(- 8)" 7! ES- mn| = 0. 
Since £ z b (the point £ lies inside (a, b)) then 
LO! = 
esi — Мп = 0 
and 
"ELS 
n! 


Hence we can write А, = M(b — a)" as 
n) 
R2 b-a" «ёо 
Substituting this expression into the formula for f(b), we obtain 


fb) = fla) 7 9 (p - a) +5 o- ay 


Lime с LO q а)". 


(n – 1)! 
This is Taylor's formula for the function f(x) and 
R, -LO o- ay, 


where £ lies Paces a and b and is the nth remainder as given by Lagrange. 
If we put п = 1 Taylor's formula for f(x) becomes as given by Theo- 
rem 8.7 


f) = fa) + © (b — а) 


Or 


Fb) — fla) = f' (906 — a). 
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Taylor's formula remains valid for any points xo and x in [a, Б] so that 
we can write it in the form 


Үд бо) 





Ло) = fon) + (x — x) + a боож 


тоо СЯ 
des Se twp &- 0) 1+ Rœ, 


n) 
where R(X) = P (x — xo)" and the point £ lies between x and xo, or 


Jf (xy + 28 - xo) 





RO) = (х = ж)" (00<0 <). 


Putting хо = 0, we arrive at Maclaurin's formula 


ед = (0) +O x EO о... + FO vr! Ru), 


n) 
where К„(х) = fe) x, 0<0<1. 


The remainder in Taylor’s formula can also be represented in a form 
given by Peano. We have assumed that f(x) has the nth derivative f(x) 
in a neighbourhood of хо. Now we suppose that (x) is continuous at 
Xo. Then f? (xo + 8(x — җ)) = f? (xo) + а(х) where a(x) ^ 0 as x — хо 
and the nth remainder 


/ (xo + 8(x — xo)) 





Ré) = a (x — хо)" 
can be written as 
Р, (х) = оар (х — xo)" 
Or 
п) = n 
во) = 7-@® к ду + SHER ж", 


where а(х) ^ 0 as x хо. 


a (x)(x — xo)" 


P = o((x — xo)") as x ^ xo 


Since a(x) > 0 as x > xo then 
and the Taylor’s formula becomes 


200 








п) 
709 = f(xo) + (x — x) +... y (x — xo)" 


+ ж. as x хо. 
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This formula is sometimes called 7aylor's formula for the function f(x) 
with the remainder as given by Peano. It is easy to notice that the error 
of the approximation of f(x) by Taylor's formula is an infinitesimal of 
higher order than (x — xo)" as x хо. Hence, this formula is suitable to 
apply if we wish to approximate f(x) at points sufficiently close to xo. So 
it is sometimes called the /оса/ Taylor's formula. 

Maclaurin's formulas for some elementary functions. We shall apply 
Maclaurin's formula 

'(0 " (0 2 n - 1) 0 БЯ 
fo = fo) LOL DIO, DI үн 


n) 
+O) ы, 0<6<1 


to get the approximation of some elementary functions. 
Examples. (1) The function f(x) = е“. 


-« We have 
Ло) = е, ЛО) = 1, 
Го) = е", J’) = 1, 


fe Зо) = е 79-00) = 1, 
f? (x) = е^, / (6x) = e^. 


Applying Maclaurin's formula, we obtain 


2 л-1 Ox 
а ee Л ИЕ ЖЕ сла сар . 
e ker oS tn- Don 0<0<1 
Putting x = 1 gives 
а lun e cee 0<0 <1. 
2! (n—-1)! nt?’ 
Since 
0 
(ERE LUE 
n! n! 
: 1 1 1 : 
the polynomial 2 + at art 22 За дэлгүүр ТОЛЕ the number e 


by defect and with the approximation error less than 2. » 


(2) The sine function f(x) = sin x. 
ча We have 


f(x) = sin x, /00) = 0, 

f' (x) = cos x, f'(0) = 1, 
РО) = -sinx, f"(0)-0, 
J” œ) = -cosx, f” (0) = -1, 
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and in general 
ff? (x) = sin (х + т 5) . 


Whence 


т) (0) = : Л _ 0 for m = 2k, 
0ши 5 8 for m=2k+1, 


and 
f (0x) = sin (o +n 5) 


Therefore, the terms with even powers of х vanish so that Taylor's polynomi- 
al involving (2n + 1) terms is identically equal to that involving (25 + 2) 
terms. Then applying Maclaurin's formula and putting п = 2k + 1, we 
obtain 


TUNE DN ONE кей xs 
sinx= т ap tg tee +(—1) Qr- iy t Ree+ 10), 
where 
R xe" оп [0+ 06+ n7], 0«0«1 
меод = gps gr sin [Oe + + ЫН « < 1. 
Clearly, 1 pe ee 
early, 2k 4 1(X) «0611 
(3) The cosine function f(x) - cos x. 


ча We have 
Дх) = cos х, ЛО) = 1, 
f'G)--sinx f'(0) = 0, 
f’@) = -cosx, f"(0)= -1 


and in general 


f? (x) = cos (х + т 3) 


50 that 


0 for m=2k +1, 


my = un 
£0) = ооз ms pee fo т = 2k. 


Applying Maclaurin’s formula for n = 2k + 2, we get 


2 4 6 
x x 
cos x = 1 — ы a у у че и 


a 6! + Rak «29, 


бе 
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where 
yr 2 
Rak 2(X) = BaD! cos |^ + (2k + 2) 2). 0<0<1. 
26+ 2 
Evidently, | Ак +2(0)1 < (06-21) 


Maclaurin's formulas for sin x and cos x are suitable when we wish to 
approximate these functions with any predetermined errors. The Taylor ap- 
proximations to the functions sin x and cos x in a neighbourhood of x = 0 
are shown in Fig. 8.62 and in Fig. 8.63. 






хі, x^ 
EET 











Fig. 8.62 Fig. 8.63 


(4) The logarithmic function f(x) = ln(1 + x). 
“4 This function is defined and differentiable infinitely many times for 
x> -1. 








We have 
f(x) = In(1 + x), ЛО) = In1 =0, 
f'& = т1-, 0) = 1, 
fo = – d /7@) = -1, 
7" 09 = 7 n | f" @ =2! 


Je 790) = (—1)'(п - 2)!, 


п) fox п+1 (n — 1)! n) = (трут (n — 1)! 
Posy G, юа) = (- GP. 
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Using Maclaurin's formula, we obtain 


n-1 
fees р + Каб), 





2 3 
Ind +xy=7-545- 2.40 


1 


where 
_ (211 1 y^ 
Rn(x) = (-1) nd my 0<0<1. > 
(5) The power function f(x) = (1 + х)“ (о is real, x > —1). 
<4 We have 
Дох) = 0 + х)", ХО) = 1, 
ГО) =al + x77), 7 (0) = о; 
f" = ala — р + 3*7, f" (0) = ala — 1); 


SE?) = ale – 1)... (а - п + 2)(1 + х) 7771, 
f°") = о(а – 1) ... (а -n + 2); 

SOW = a(a — 1)... («а -n+ Dü + х)", 
S (6x) = ala — 1)... (@— n4 D + 0)". 











Whence 
"T a ala — 1) 2 
(1 + x) —-lbtdpxt ay x 
+... 426 NS эж п 2) уп-а 4 RO, 


where 





R09 = 2 - D---(a-n* Da а-па 001, » 
n! 


It is easy to observe that for natural a = т Maclaurin's formula for 
(1 + x)* becomes the binomial formula 


. +x" эх. 





(1 +0)" = 1+ ya fun —D x... 
1! 2! 

Maclaurin's formula and equivalent infinitesimals. We have established 
equivalence relations for infinitesimals in Chap. 7. These formulas are 
suitable to represent elementary functions for sufficiently small !x! and 
to evaluate limits of these functions. However, it is not a rare case when 
the limit in question depends on higher degrees of x than those involved 
in equivalence relations previously obtained. In this case we need to extend 
these equivalence relations to terms involving higher degrees of x to be able 
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to compute the limits of functions as x > 0. These extensions can easily 
be obtained by applying Maclaurin’s formula with remainder specified by 
Peano. The following is the list of the equivalence relations thus obtained: 








2 n 
x x 5 
-148Х458-49790079, x0; 
x XE 2n + 2 
1 Сэ ЭГ Е п п . 
sin x = x ata —...+(—1) Qna pi + ), x26 
Хо x* 2n 1 
esx-l-» t4 -e tCD Qn D tee ) x26 
Bebe e +D"! E + обуу x20 
5 3 282 " : : 
(3o 1+ ох OTD дз 
- 4, g(a — D ... (a — n £D үү ox”) x>0 
` п! i { 
Example. Evaluate lim *— эс X 


x0 


ча The equivalence relations given in Chap. 7 are of no value here since 
the principle term of the limit depends on x?. However using the equiva- 
lence relation for sin x given in the above list, we easily obtain 


x-( -3+ °®®) 
x-sinx - 3! 
= lim - 








lim ———,— = 
x70 x x20 Xx 
4 
= lim L 903 cb ox 
x20 X3! x 6 


Taylor’s theorem and investigating a function for extremum. Taylor’s 
formula is helpful to find extrema of a function. 

Theorem 8.21. Let a function f(x) have the nth derivative f? (x) in a 
neighbourhood of a point хо. Let f(x) be continuous at хо and let 
ST’ (xo) = f^ ош) =... = f" P(x) = 0 and f (xo) ¥ 0. Then (i) if n is 
odd f(x) has no extremum at хо; (ii) if n is even and f (хо) < 0 then f(x) 
has a maximum at хо and (iii) if n is even and f™ (xo) > 0 f(x) has a mini- 
mum at Xo. 
++ Observe that f(x) has an extremum at xo if there exists ё > 0 such that 
the difference f(x) — f(xo) retains its sign on the interval (xo — 6, xo + б). 
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Applying Taylor's formula, we have 








Ло) = foa) + EE c — xy) 47-09) (е wo)? 
JEP (xo) 22 “770 + 0(x — xo)) ü 
deno twp €» 1 E O (x хо, 
where 0 « 0 « 1. Whence 
fx) — f(x) = иш шы, - xy" (9) 
since f’ (хо) = f” (хо) =... = f" P (xo) = 0. 


By the hypothesis / (х) is continuous at хо and f? (xo) # 0. This im- 
plies (see Theorem 7.22) that there exists 6 > 0 such that at every point 
of (хо — ô, xo + 6) f(x) is of the same sign as f (xo). 

The following cases have to be distinguished: 

(а) n is even and f? (xy) > 0. Then 


f? (xo + 8(x — хо) 

n! 

so that (*) gives 
SO — f(x) 2 0. Vx€(xo — 8, xo + à). 


By the definition of a maximum of a function we infer that f(x) has 
a minimum at xo. 
(b) n is even and f(x) < 0. Then we have 


f? (Xo + 8(x — хо) 
n! 
and f(x) — f(xo) < 0 for all x € (xo — б, xo + 6) so that f(x) has a maximum 
at Xo. 
(c) n is odd and / (х) # 0. Then 
n) 
f (Xo E = Xo)) (x 23 Xo)" and / (xo) 
are of the same sign whenever x > хо and of opposite signs as x < хо. 
Hence, for any whatever small 6 > 0 the difference f(x) — f(xo) changes 
its sign on the interval (xo — à, xo + 5). This means that f(x) has no extre- 
mum at хо. 9» 
Example. Investigate the functions y = x* and y = x? for extremum. 
"4 It is easy to see that the point x = 0 is a critical point for these func- 
tions. The first three derivatives of y = x* are zeros at x = 0 and 
f°) = 24 > 0. Hence, n = 4 is even and f? (0) > 0 so that y = x* has 
a minimum at x = 0. 





(x — хо)" 20. Ухє (хо — à, xo + ô) 





(x — хо)" <0 Vx€(xo — 9, Xo + б) 
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The function y = x? has no extremum at x = 0 since its first two deriva- 
tives are zeros at x = 0 and the third derivative is not equal to zero so that 

= 3 is odd. » 

Remark. Taylor's formula is helpful to prove the following theorem 
which specifies the sufficient conditions for a function to have a point of 
inflection at xo. 

Theorem 8.22. Let f(x) have the nth derivative f(x) in a neighbour- 
hood of хо and let f? (x) be continuous at хо. Moreover, let f" (хо) = 
f" (xo) =... =f" "P (x) = 0 and f (xo) ж 0. Then Mo(xo, f(xo)) is the 
point of inflection of the graph of y = f(x) if n is odd. 

To verify this theorem it suffices to consider the function f(x) = x’. 


8.14 Vector Function of a Scalar Argument 


Definition. Suppose a point M moves along some trajectory L. We 
can locate this point at any time / by indicating its position vector г, its 
velocity vector v, its acceleration vector w, etc. Each of these vectors can 
be regarded as a vector function of a scalar argument /, ie, г = г(/), 

= v(f), w = w(t). 

We say that a vector function of a scalar argument / is defined on the 
interval (о, 8) and write a = а(/) if there is a certain law which assigns 
a definite vector а to each £ from (a, 8). 

Let a be expanded relative to the unit vectors i, j, k of some system 
of coordinates so that 


а= xi + yj + zk. 


If a = а(/) is a vector function of the argument : then the coordinates х, 
у, 2 of a = a(t) become scalar functions of the argument f, i.e., 


x= g(t), y=¥(0, z= 70, а <: < В. 


Conversely, if the coordinates x, y, z of the vector a are functions of the 
argument f so is the vector a, i.e., 


a = (fi + (Nj + Ok. 


Therefore the vector function a(f) is fully determined by the scalar func- 
tions x = (f, y = V(f) and z = y(f) and vice versa. 

The length and direction of the vector а(/) can in general be different 
for different f; also, а(/) сап be applied at different points for different 
t, for example as the velocity vector of a moving particle. 

Suppose now that the vector a(/) starts at a given poirt O in space. 
For different / а(/) ends at different points which form some set of points 
in this space. This set of points corresponding to all possible terminal points 
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of a(t) when ¢ varies over its domain of definition is called the hodograph 
of the vector function a(f). In general the hodograph of а(ї) is a curve 
which contains all points of the space where a(f) ends (Fig. 8.64). Observe 
that the hodograph of the position vector r of a moving point coincides 
with the trajectory L of this point. 

The equation 


г= (0), o«t«f 
ог 


r(t) = Hi + VOI + Ok 


is called the vector equation of the curve L. 

















Fig. 8.64 Fig. 8.65 


The equations 


x = e(t), 
y = №00), а<1< в 
z = ү(0), 


are called the parametric equations of the curve L. 
For example, the equations 


х= Rcost, 
y=Rsint, 0«1«2т (К, h are constant) 
12-11, 


are the parametric equations which describe one turn of a screw line (or 
a circular helix) in space (Fig. 8.65). 
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Limits and continuity of vector functions. Let a = а(/) be defined in 
a neighbourhood of a point £ = fo except probably at £ = £o. The constant 
vector A is called the /imit of the vector function а(/) as t > fo if, given 
any € > 0, there exists 6 > 0 such that 


la(f) - Al < ғ 


whenever If — fol < 6 and t z to. 
In symbols one writes 
lim a(t) = A. 
tto 
The geometric interpretation of the notion of a limit of a vector func- 
tion а(/) is that the length of the vector a(/) — A tends to zero as f > fo 


ө\ 


А 





Fig. 8.66 Fig. 8.67 


so that the vector а(/) tends to coincide with the vector A as t— to 
(Fig. 8.66). Hence 


(lim a(?) = A) e (lim la(f) – Al = 0). 
Let a(t) = e(Di + (0) + yk and A = ai + bj + ck. Then 
lay - Al = V(o() — a)? + WO — bY + YO — oy. 
Whence, if lim a( = A then 


>to 





lim e( =a, lim y() =b, lim y(=c 
1 to t> fo tto 
and vice versa. 

Let a vector function a(t) be defined on the interval a < t < 8 and 
to € (a, B). We say that the vector function а(/) is continuous at t = to if 


lim а(/) = a(to). 


I fo 
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Differentiation of vector functions. Let a = а(/) be a vector function 
defined on the interval a < ¢ < 8 and let a curve L be a hodograph of 
a(t). Then a certain value / € (a, 8) corresponds to the point M on L. Sup- 
pose that Aż is an arbitrary increment in ¢ such that ¢ + At € (a, 6). Then 
the vector a(/ + At) gives the point M, on L (Fig. 8.67). 
Consider the increment Aa in the vector function а(/) corresponding 
to the increment Af so that 


Aa = a(t + At) — a(t). 





The ratio 
Aa _ a(t + Ar) — a(t) 
ART AT (At = 0) 


is a vector collinear to the vector Aa. 
Let the ratio да have a limit as At > 0. Then this limit is called the 


derivative of the vector function a(/) with respect to the scalar argument 


амд 


t at the point /, customarily denoted by or by a'(7). Hence 





da(t) _ im 48 jim 2€ + 4D — al 
d 


ar-o At — Ar 0 AC 


The vector function a(f) having a derivative at a point ¢ is called 
differentiable at t. 

Now we shall investigate how the vector a is directed. The point Mı 
moves along the hodograph L towards the point M as At > 0 so that the 
secant MM, tends to the tangent to L at M. Hence, the derivative аа 15 
а vector tangent to the hodograph of а(/) at M and the direction of a 


coincides with that of the motion of the terminal point of the vector a(t) 
caused by the increase of the parameter f. 


The derivative ^ can be expressed in terms of the coordinates of а(ї). 
Let a(t) = e(0)i + (0j + y(Dk. Then 

Aa(t) = a(t + AD) — a(t) = iAe(n + jAV(0 + КАУ(0. 
Dividing both sides by At = 0, we obtain 


Aa(t) _ į; Ae Ay() Ay) © 
А a Ед EA 
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If the functions (£), V(f) and y(t) have derivatives at a given point 
t then every summand on the right has a limit as Af > 0 so that there exists 
a limit of the left-hand side of the above expression, i.e., there exists the 


derivative a . Evaluating the limit as At > 0, we get 





da _ ide аф dy 
dt Nap ae (+) 


Hence, relative to a stationary system of coordinates the derivative йа of the 


vector a(/) is given by the formula (8) so that in order to compute the deriva- 
tive of the vector function а(1) we have to compute the derivatives of the 
coordinates of a(t). 

Let r = r(/) be a position vector of a point moving in space. Then the 
velocity of this point at the moment / is determined by the derivative of 
r(/) at t: 

dr(t) 
dt 


Example. Find the derivative of the vector function 
a(t) = iR cos t + jR sin t + kht (А, h are constant). 





= У(0, 


за Applying (*), we get 


е -iR sin г + jR cos г + kh. > 


Differentiation laws. (1) If c is a constant vector then - -0. 
(2) If a(t) and b(?) have derivatives at ¢ then 


4 a) Ху ам " %0 


(3) i a vector function is ME by a scalar its derivative is multi- 
plied by this scalar so that 


d(oa(t)) 
dt 








=a at) (a is a constant). 


(4) The derivative of the scalar product of vectors is given by 


d _ | аа db 
di (a(t), Ь(0) = (ж ; ») + (a. A) 


Corollary. If а(/) is a unit vector, i.e., le(t)| = 1, then ^ is perpendicu- 
lar to e. 
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+4 Indeed, if e is a unit vector then (e, e) = 1. On differentiating (e, e) = 1, 
we have 


de de\ _ 
(5) 5 (6а) = 
de Е 
2 (Fe) =0. 


Whence we infer that а is perpendicular to е. >» 


or 


(5) The derivative of the vector product of the vectors is given by 


d da db 
at [a (7), b(^)] = E , » + E | . 


Exercises 


Compute the derivatives of the following functions 


1. у-х —-5х+1,2.у=2ух—-— + 5. 


xi- 





24 


3. у = (х2 — 3х +30? — 1). 4 у = «x « n. - ).s»- T 
xl 


3 





1 
=œ «»(s- 1). "0 =? 7. у= : 
( ) xi ›'@) y EF 
Ae gine 29, у= X... ду y= sin? x. 
8. у = sin x — cos x y ТЕ сб y= sinf х 
11. = tan? х — tan x + х. 12. y = sin 5х. 13. y = 2 sin(3x — 1). 


14. у = sin l/x. 15. у = sin? 2x. 16. у = sin (sin x). 17. y = x sin! x. 
18. y =x sin x tan ^! x. 19. у = sint., 20. y = tan ! x’. 


In x 


т+х? 24. y = In tan x. 
x 


21. y = In? x. 22. у = х? loga x. 23. у = 


X 
25. у= 1. 26. y = 9 27. y = хе 28. yet. 


29. y = x? — 3". 30. y = 105791, 31. y = 595, 32. y = sin (3*). 


26—9505 
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33. y = sinh? x. 34. у = Vcosh x. 35. у = tanh (In x). 36. y = 35in&^» 
31. y = (sin x)". 38. y = sing, 39. y = x". 


_ оо 2 
40. у= &— 2) Nx *1 41. у= 2х? - а? - nc Vx? - ab. 





(x — 5) 
1 x 1 cosx 
42. у= =] 2-2 Я 
y э In tan 2 sin? x 


43. y = 3х? sin ! x + (х? + 2) V1 - x. 
44. у = x (sin ! x)? - 2x + 2V1 – x^ sin ^! x. 
Compute the first differentials of the functions 
1 
4x4 
48. Compute the approximate value of tan~' 1.02. 
Compute the derivatives of higher orders for the functions 
49. y -x 3x + 5, y" = ? 50. y = tan ! x, y"(1) = ? 
51. y = a", у” =? 52. у = х? lnx, y" =? 


= gin sin x 


45. y — . 46. y = tan? х. 47. y 


Write down the nth derivatives of the following functions 
1 

x +x! 

Find the differentials of higher orders for the functions 


56.y—-e 72 Фу = ? 57. y = х", d'y =? 


53. y = хе". 54. y = sin? x. 55. y = 


Compute the derivatives 2 of the functions given parametrically 


EV UN = 3 20352 
58. 2 20-11 sg, A a cos 1.68, {5 P 
=. = b sin’ t. me. 


2 
Compute the second derivatives a of the functions given parame- 
x 
trically 
= E 
61. * = In 5 62. {х= 2 cos t, 
=f. =e sin f. 
Draw the graphs of the following functions 


2 2 3 

3 x- 1 x^ +1 х – 4 

„‚у=х- х. 64. у = . 65. у= ——__.. 66. y= : 
63. y = x — x . 64. y x y x 66. y z 








get D 
2x +1)` 


x—1 


67. y = xe *. 68. у = 69. y=2In 4 1. 
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-8 — x? 
deca Tes Vx(x - 19. 72. y = Мх - 2). 


73. y = Vx - D - Yi - 2y. 


Compute the maximum and minimum for each of the following func- 
tions defined on closed intervals 


70. y= 


Час: us qut for хє, 4]. 
х 


2 


15. у = У2х х - 6) for xe [-2, 4]. 


16. y = 10x + 10 


-—————— for xe[-l1, 2]. 
х? + 2х + 2 1 


71. Expand the polynomial x? + 3x? — 2x + 4 in powers of x + 1. 

78. Expand the polynomial x? — 2x? + 3x + 5 in powers of x — 2. 

79. Use Taylor's formula to expand the function f(x) = 1/x in a neighbour- 

hood of the point хо = —1. 

80. Use Maclaurin's formula to expand the function f(x) = x e* in a neigh- 

bourhood of the point хо = 0. 

81. Use Maclaurin’s formula to find the expansion of the function 

F(x) = e*?*? with the remainder o(x^). 

82. The shape of a string hanging of its own weight alone is given by the 

equation of the catenary y — a cosh x/a where a is a constant. Show that 

given sufficiently small |x! the string takes the shape of a parabola. 
Use the derivatives of higher orders to investigate the shape for each 

of the following functions in a neighbourhood of a given point xo. 





83. у = sin?(x — 1) — x? + 2x, Xo = 1. 
84. y = cos x + cosh x, Xo = 0. 
85. y = x? + 2 In(x + 2), Xo = —1. 
86. у -x!-2e* |, хо = 1. 
Answers 
" , 1 1 , 4 3 2 
1 у= 2x - 5. 2. у = — + —.3. у’ = S5x* — 12x" + 9x* – 2х + 3. 
Ух x? 
ES 
Mec ЛЕ ee ey eue Gg? Ed 
2vx x QÊ + 1È 3X1 + x2) 
8. у’ = cosx + sin x. 9. у’ = . 10. y’ = sin 2x. 11. у’ = tan^ x. 
, 1 + cos х 
1 1 
12. y’ = 5 сэѕ 5х. 13. у’ = 6 cos (3х – 1). M. у’ = - 5 cos —. 
x x 


26* 
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"X x 
15. у' = 10 sinf 2x cos 2x. 16. у’ = cos (sin x) cos x. 17. y’ = sin y I 
1-х 
18 а ae vote ea 19. y = - 2 
! 1+ x? Ixl Vx? — 4 
20. y' = 2 . 21. у' — 21пх.22.у' = 2х1овх+-^—. 
1 + x* x In3 
— 2x? 2 1 
23. y! = 14x? ~ 2х ша 24. y! = ——-.25. y’ = - —_— . 26. y'= 9 In 9. 
x (1 + xy sin 2x x In? x 
21 2-1 2х7 
27. у' = ex + 1). 28. y = ©! = Dt2 оф у, = 3x? — 38 In 3. 
x 
30. у’ = 3:102 *! In 10. 31. у’ = 5%"* cos x In 5. 32. у’ = 3" cos(3x) In 3. 
h 1 
33. y' = 3 sinh? x cosh x. 34. y' = шинж, ЗБ), 
2Vcosh x x cosh? (In x) 
2 
36. y' - 39995 Sinh 2x In 3. 37. y' = (sin x)" = — sin x In (sin 9) 
sin x 
38. у' = yim (co (In x) + Fn 39. у’ = 2x er Min oe 
x 
40. y' = 2x - 20 + Их + D à - а?, 42. у' 
3(x — 5)* Мес + + т? sin? x 
dx 2tan x 
43. y' = 9x? sin ^! x. 44. y' = (sin^! х)?. 45. dy = – — 46. dy = dx. 
x cos? x 
" 1 
47. dy = 5" 5^ * cot x In 5 dx. 48. tan`? 1.02 = 0.795. 49, y" = 6х. 50. y"(1) = - 2 
51. у” = 27a" In! a. 52. у” = x'(60 In x + 47). 53. у = e'(x + n). 
1 1 
54. y = 2"-1 зіп |2x «(n - I=]. 55. y = (- "n! -———— 
> ( 5 У ( ) х"! (1 + х)" +! 
Hint. Express the function in the form y = -— l SS acm din md : : 
x (1 +x) x (1 + x) x +x 
2 ~ 7/272 2 3 m-3 dy 3,5 
56. d'y =e (x^ — Idx’. 57. Фу = m(m – 1)(т - Dx" dx*. SE =t. 
x 
- -4 
s9. Z- Баас 60. Z = op 61. TY шор 6з, P 25 f 
dx a dx dx? dy! (cost — sin t 


63. Fig. 8.68. 64. Fig. 8.69. 65. Fig. 8.70. 66. Fig. 8.71. 67. Fig. 8.72. 68. Fig. 8.73. 

69. Fig. 8.74. 70. Fig. 8.75. 71. Fig. 8.76. 72. Fig. 8.77. 73. Fig. 8.78. 74. М = 1, m = -1. 
75. M = 0, m = -4. 76. М = 5, т = 0. TI. (x + 1)? — 5x + 1) + 8. 

78. (x 2)? + 4(х – 2)? + 7(х – 2) + 1. 79. (х) = -1 - x +1) - (x 1)... – 





n x? 3 
бо ey OD бшер вулду=<х+^-+^+...+ 
[71 + 6(x + D]^*! | 1! 2! 
CEU a aea луг” oed < 1. 81. fO = Y E? КЕ + 002), 83. Fig. 8.79 
c сын" f .81. 2 xm . 83. Fig. 8.79. 


84. Fig. 8.80. 85. Fig. 8.81. 86. Fig. 8.82. 
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Chapter 9 


Integral Calculus. 
The Indefinite Integral 


9.1 Basic Concepts and Definitions 


Antiderivative. Differential calculus provides methods of computing 
the derivative f'(x) of a given function f(x). Now we shall examine ap- 
proaches and methods of solving the problem inverse to that of differentiat- 
ing functions; namely, we shall study the problem of finding the function 
Ах) provided that its derivative f’ (x) is given. The branch of mathematics 
that deals with methods and techniques of solving the latter problem is 
called integral calculus. 

We say that the function F(x) is the antiderivative (or primitive) of the 
function f(x) on the finite or infinite open interval (a, b) if F(x) is differen- 
tiable at every point of (a, b) and F'(x)- f(x) ог, equivalently, 
dF(x) = f(x) dx for all x € (a, b). 

Examples. (1) The function F(x) = sin ^ !x is the antiderivative of the 


function f(x) = on the interval ( — 1, 1) since for all x € (—1, 1) 


F’ (x) = (sin (х) = фэн . 
-x 


(2) The function F(x) = ru. ‚ O < a = 1, is the antiderivative of the 


function f(x) = а" on the interval (— oo, + oo) since for all хє (— oo, + оо) 
nao GC \ _ аша NM 
мон (12) шаг 2 
If F(x) is an antiderivative of the function f(x) on the interval (a, 9) 
so is the function (x) = F(x) + C where C is an arbitrary constant. 
Indeed, $'(x) = [F(x) + C]' = F'(x) = f(x) for all хє (a, b). There- 
fore if the function /(х) has an antiderivative on (a, Б) then f(x) has in- 
finitely many antiderivatives on this interval. 
The following theorem establishes the relationship between any two dis- 
tinct antiderivatives of a given function. 
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Theorem 9.1. Jf F(x) and Ф(х) are any arbitrary antiderivatives of a 
given function f(x) on the interval (a, b) then the difference of F(x) and 
(x) is equal to some constant so that Ф(х) — F(x) = С, хє (а, b) (C is 
a constant). 
< Since F(x) and Ф(х) are antiderivatives of f(x) on (a, b) then F’ (x) = 
f(x) and Ф'(х)=/(х) for all хє(а, Б). Оп differentiating 
ф(х) = Ф(х) — F(x) we obtain e'(x) = ©’) - F'(x) = f(x) - Дх) = 0 
for all x € (a, b). Let xo and x be two distinct points in (a, b). Then applying 
the mean value theorem (Theorem 8) to the function ф(х) on the closed 
interval [xo, x], we get 


e(x) — ф(х) = (x — xo)e' (8), 


where xo < £ < x. 

Since y’(x)=0 on (а, b) then e9'(£) = 0 and, consequently, 
ф(х) = (хо) for all x є (a, b). Hence, the function (х) is constant оп (a, b) 
and (x) — F(x) = C, where C is a constant for all xe (a, b). > 

Corollary. If F(x) is any antiderivative of the function f(x) on the inter- 
val (a, b) then any other antiderivative Ф(х) of f(x) takes the form 
Ф(х) = F(x) + C where C is a constant. 

The indefinite integral. The set of all antiderivatives of a function f(x) 
on (a, b) is called the indefinite integral of f(x) on (a, b), written | f(x) dx. 
The symbol | is called the integral sign, the expression f(x) dx is called 
the element of integration, the function f(x) is called the integrand and 
the variable x is called the variable of integration. 

If F(x) is any antiderivative of the function f(x) on the interval (a, b) 
then by virtue of the above corollary we have 


| I(x) dx = F(x) + С, 


where C is an arbitrary constant. 

Observe that any identity involving indefinite integrals in the left-hand 
and right-hand sides establishes the equality between sets of antiderivatives. 
This equality means that the sets contain the same elements, i.e., the same 
antiderivatives. Е 

Sometimes the expression | f(x) ах is thought of as any element of the 
set of antiderivatives, i.e., any antiderivatives of the function f(x). 

We shall prove elsewhere the following theorem on the existence of the 
indefinite integral of a given function. 

Theorem 9.2. Let f(x) be continuous on the interval (a, b). Then the 
function f(x) possesses an antiderivative and, consequently, an indefinite 
integral on (a, b). 

The operation of computing the antiderivative or the indefinite integral 
of a function f(x) is called the integration of f(x). Clearly, this operation 
is the inverse of differentiation. 
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Properties of the indefinite integral. In what follows we assume that 
all functions involved are continuous on the interval (а, b); hence, we as- 
sume that the functions possess indefinite integrals on (a, b). 

(1) The element of integration is equal to the differential of the in- 
definite integral so that 


da(| Дх) ах) = f(x) dx. 
ча Indeed, since F'(x) = f(x) for all хє (а, b) then 
а(\ fx) dx) = d[F(x) + C] = dF(x) = F'(x) dx = f(x) dx. » 
(2) The integrand is equal to the derivative of the indefinite integral: 
([/о)ах) = foo. 


ча This readily follows from property (1). > 
(3) The indefinite integral of the differential of some function differs 
from this function by an arbitrary constant: 


[dFG) = F(x) + C. 
«4 Indeed, if F'(x) = f(x) for all хє (a, b) then 
\4Р(х) = [F œ dx = Uo dx = F(x) + С. >» 
(4) If the integrand is multiplied by a constant factor then the indefinite 
integral is multiplied by this factor: 
JAF) dx = А(Дх) dx (А = const, А = 0). 
“4 Indeed, by virtue of property (2) we have 
(АА) ах) = АЛ) and (Al f(x) dx) = A(N dx) = А/О) 
so that | Af) dx and A\ f(x) dx correspond to the same set of antideriva- 


tives of the function Af(x). > 
(5) The indefinite integral of a sum (difference) of two functions is equal 
to the sum (difference) of the indefinite integrals of these functions: 


1100 + e(x)] dx = Uo dx + [eoo dx. 
за By virtue of property (2) we have 
(SL + еб] dx)' = Л) + eco. 
On the other hand we have 
(роо ах + (еб) а) = (w ах) + (fe) ах) = f(x) + ф(х). 


Therefore both fi ТО) + Ф00| ах and Vo dx + [eoo dx are an- 


tiderivatives of the function f(x) + ф(х). Hence, they differ from each other 
by some constant С. > 
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Corollary. 
| | 3 A00] dx = Y A од dx, 
k=1 k=1 


where Ay = const (А = 1, 2, ..., n). 
The expression of the form 


Y Axfx(x) = Aifi(x) + Ар(х) +... + Ад/ 00, 
k=l 


where all Ак are some constants, is called the linear combination of the 
functions fx(x), k = 1, 2, ..., n. Therefore we can formulate this corollary 
in the following way: the indefinite integral of a linear combination of any 
finite number of functions is equal to the linear combination of the in- 
definite integrals of these functions. 

Properties (4) and (5) are sometimes called the linear properties of an 
indefinite integral. 


Some indefinite integrals involving elementary functions. Every formula 
for a derivative of a given function, i.e., a formula of the form F’ (x) = f(x) 
can be represented in terms of the function f(x) and its antiderivative F(x) 

$ | f(x) dx = F(x) + С where С is an arbitrary constant. This enables 
us to derive the integration formulas directly implied by the differentiation 
formulas for basic elementary functions. These are: 


yet 
|а = ———-—— +C аж —1,х > 0. 
а +1 


| Eur In|x| +C, x z 0. 
x 
(еа = та + С 0<а #1. 


(In particular, putting а = e gives je'dx = е* +C) 


sin xdx = —cos x 4 C. 


4 = -cotx+ G x & nz (n 2 0, +1, +2,...). 
sin X 





= tanx + C, x + пт (п= 0, +1, +2, ...). 
со 





| сохах = = sin x + C 
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(< ах == іп іх+С -1< |х| <1. 
1-х 
| Eu =n + с |x| «a. 
а – х 
ах 2 2 (co ” 1 
———— =ш|х+ух + а? | +С (the “minus” sign re- 
Vx? + а? 


quires the condition |x| > |а| be fulfilled). 


| E -tan !x 4 C 
x 


tan ! = +С a #0. 


&j- 


a + х2. 


sinh хах = cosh x + C. 


= = sinhx + С 


А Dro = tanh x + C. 
“Cos 

| _ = —cothx + C x #0. 
sinh^x 


These formulas are easily verified by differentiating their right-hand 
sides whose derivatives are equal to the respective integrands. 

The distinction between the operations of differentiation and integra- 
tion is that on differentiating elementary functions we always obtain 
elementary functions while subjecting some elementary functions to in- 
tegration can sometimes lead to nonelementary functions, i.e, there exist 
elementary functions which are nonintegrable in elementary functions. For 
example, it has been established that the following integrals cannot be 
represented in terms of elementary functions although, due to continuity 
of integrands, these indefinite integrals exist in the domains of definition: 


| e ах, | sinx dx, x #0, 





| sin x^dx, | — dx, x £0, 


| cos x^ dx, | ax 0O<x# 1. 
Inx 


414 9. Integral Calculus. The Indefinite Integral 





Some indefinite integrals can be reduced, by suitable manipulations on 
their integrands, to integrals easily computed by applying the integration 
formulas that we have derived for basic elementary functions. To illustrate 
this method of integration we turn our attention to the examples of comput- 
ing each of the following indefinite integrals. 


2 
Examples. (1) 106 - Js) dx. 


ча Manipulating on the integrand we get 


2 
| (v8-~5) dx = \(#-24 Jy) ae = [e-2 + х?) ах 
4 


3 3 х x C X 1 
= — E m. — —--- = == шв- -- P > 
E dx 2 [ae |: dx 4 2х+ 23 + 4 2x а 
(2) | @ Хх. de 
хх 
ча We have 
2 2 
Apron dec | lt2x tX бх | Ч + х?) + 2x dx 
X tx x(l + x^) х(1 + x?) 
= (15-22 dis os Жон = Іа |х| -2tan ! x« C. 
х 1-Х х l+x 


dx. 





o | DX EXD 


-« We have 


(эхэ ax =2\ (3) ax +3| (3) ax 


(3/5) (2/5) НЕ 
БААУ 








9.) Methods of Integration 


Integration by substitution. We wish to compute the indefinite in- 
tegral (//(х) dx of the continuous function f(x). Suppose that there exists 
a function x = (t) having a continuous derivative o'(f) and the inverse 
function £ = y(x). Then we may write 


Oe) dx = [Fle] p(t) at. 


Clearly, substituting £ = y(x) into the right-hand side of the above identity 
we shall get the integral in terms of the original variable again. 
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-4 To prove the above identity we differentiate the left- and right-hand in- 
tegrals with respect to x. On differentiating the left-hand integral, we get 


(Лоо) ах), = Лоо). 
Observe that / = y(x) is the inverse function of x so that 
1 1 


ty = =- = +H , 
"ox g(t) 
where e'(f) = 0. 
Then applying to the right-hand integral the chain rule of differentiating 
a composite function, we obtain 


(| AO e'() d?), = (| Лев” (dr), - t 


= flee! (0 --1—. = fiet) = Л). 
220) 

Since the left- and right-hand integrals have the same derivative these 
integrals define the same set of antiderivatives of the function f(x). Whence 
it follows that the desired identity is true. > 

The indentity being considered lays down the method of integration 
called integration by substitution. When using this method we have to 
choose a suitable function (t) to simplify the computation of the original 
integral. 


Examples. Compute the following integrals. 


dx 
(D | pf (а> 0. 
Vi? + а? 


ча Putting x = a sinh f, we get dx = a cosh t dt. Then 





| dx -| acoshtdt _ |t T [а= 
Vx? +a? N a? (sinh? 1-1) a cosht 


To express the result in terms of x we solve the equation x = asinht 
t -t t —t 
е-е е-е 
а ss 


to get ¢ as a function of x. Since sinh! = 2 А 7 





or ае?! — 2xe' — а = 0. Whence we get 
ro XENI + а? 
2 : 
1 х + Мх?+а? 
Since е' > 0 we have e‘ = е REN and t = lIn (x + Nx? d? )- 


Ina. Finally, we obtain 
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| Linge М2 жа) +С 
мх? + а? 
where С = C – Ina. > 


о | —_@ 
(x + 2)\Vx + 1 


ч Put x = 1? — 1 so that dx = 2tdt and t = Vx + 1. Then 





-2 dis. = 2їап- +C 
t 


| ‚ах = | ха _ 
Cone УШЕШ +i 


Substituting / = Ух + 1 into the right-hand side, we get 


| ООХ. ыу 2tan jx «1 +С >» 
(x + 2)vx + 1 


Remark. Suppose that in the integral 1463) dx the element of integra- 
tion f(x) dx admits a representation of the form 
Јо) dx = g [W] v' (x) dx 
so that 
f(x) dx = g [69] d[V(x)]. 
Assume that the function g(t) is easily integrable, ie, the integral 
[gc dt = F(t)+ C 
is easily computed. Then substituting / = (x) into the latter integral, we 
obtain. 


yo dx = F [V(x)] + С. 
(3) | 2 = 225 dx. 








20227 
а Put /—-2* +27" (»0) so that dt = (21n2—2 *In2) dx and 
х 5-x E dt 
(2-2 )dx = qm» . Then 
25—275 4. | dí _ 1 . In (27427?) » 
| i27 dx- ind | 5 1н2? В+ C- -- S ke tC. 
2x 
dx 
(4) | PAP. M 
Vert 


en cef 5 хах 
ч Putting Мех + 1 = f, we obtain - ZEN QE 
| Ме* + 1 


= 2dt and е = г? ~ 1. 


Then 
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| e"dx —— | x e'dx 


= ber a eS je – p at 
Ме + 1 Ме + 1 


3 
= (5 - ) i сг (+12 -246-1-0 


3 


= (= 2) Ver +1 -с» 


Integration by parts. Let the functions u = u(x) and v = v(x) have con- 
tinuous derivatives и (х) and v'(x). Then applying the product rule of 
differentiation, we have 


[u(x) 50017»: u(x) v'G) + vG)u' (x) 
so that the product u(x) v(x) of the given functions is an antiderivative 
of the sum u(x) v’ (x) + v(x) u' (x). Hence 
шо) v'(x) + v(x)u'(x)] dx = u(x) v(x) + C. 
By virtue of the linear property of indefinite integrals this expression 
can be written as 
[400 v'(x) dx = u(x) v(x) – jer) u'(x)dx + C 
or 
{и dv = uv – {о du +С 
since v'(x) dx = dv and u' (x) dx = du by the definition of the differential. 
Noting that the constant C can be combined with either of the integrals 
involved we may write the above identity as 
fu dv = uv — |v du. 
This identity lays down the method of integration called integration by 
parts. The idea is to replace the original integral fu dv by the integral [v du 


easily computed in some specific instances. The method involves splitting 
the original element of integration into two factors и and dv = v’ dx and 
subsequent differentiation of the former and integration of the latter. Clear- 
ly, integration by parts is helpful and constructive when this splitting pro- 
vides easily differentiable и and easily integrable dv. 

To illustrate integration by parts we shall work through the integral 

(2 — 3x) cos x dx. 

ч Here udv = (2 — 3x) cosxdx. Put и = 2 — 3x and dv = cosx dx so 
that du = —3dx and v = [cos x dx = sinx. Then 


fu dv = иу – {о ди = (0. - 3x)cos x dx 


= (2 — 3x) sinx + 3|sin x dx -(2—3x)sinx -3cosx + С > 


2717—9505 
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Notice that if we put и = соѕх and ар = (2 – 3х) ах or 
и = (2 — Зх) соѕ x and dv = ах the integral [v du becomes more complicat- 
ed than the original one. 

On integrating dv to get the function v we may choose any suitable 
value of the constant of integration C since C is not involved in the final 
result (to verify this it suffices to substitute v + C for v in the identity 
describing integration by parts). In particular, it is convenient to put C — 0. 


Examples. (1) We wish to compute the integral |In x dx. 
а Noting that udv = Іп хах we have the unique factorization и = Inx 


and dv = dx. Then du = x and v= (ах = x; whence we obtain 
[ахах = xInx — [dx = xInx - x + С = х(ах- 1) + С. > 


(2) Integrate by parts IV a - х? dx (|x| «141» 
A Put и=^/а* — x? and dv= dx so that du = —- EU. апа 


v = x. Integrating by parts, we get 
2 
[vaa ах = хма? -x + | шал бк. олы, 


On manipulating with the integrand of the right-hand integral, we 
obtain 


2 2 
[ve -x!dxexNa^ -x + | шоог ) dx +2C 


а? – х 
_ 2 2 2 ax | 2 2 
=x а-х +a Se sek a ч= Va —x dx + 2C 
| 
5 [м — dx + 2C. 


We have obtained the equation in one unknown [М a? — x* dx. Solving 
this equation, we get 


2 2 
| vg ug de=} ха —› ж-ш t4 C. 


a 


=x Ма? — x? + a? sin 


Problem. Verify the following formulas 


2 

(a) | Vx xa ах =! x Nx +a + S Ine vx + a?) + С; 
2 

€ [Ма ах= хм х? – а? - In| xt Vx? = а? | + С, 


NI 


N| = 


where |x| > lal. 
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Remark. Sometimes we can use integration by parts to compute the 
right-hand integral |v du. To illustrate this approach we shall work through 
the following examples of integrating by parts. 

Examples. (1) We wish to compute the integral (x? 2 dx. — 

4 Put u = х? and dv = 2" dx so that du = 2xdx and v = 

Integration by parts gives 


12° 














зак ог ДР A 20 х 
E 2*dx =x na quo" (х2 ах. 
The right-hand integral can also be integrated by parts. If we put u — x 
x 
and dv = 2'dx then du = dx and v = 25 so that 
Dept Жыз oo А 2 2* 1 x 
[ева =x In2 In2 (s In 2 In 2 (2 ax) 
ee ANE 2* 2* 
~*~ n2 12 (« Ind 288) НЫ 
2 2 2 2" 
= — 23 » 
(х In2 ** 027) 13 TO 


(2) Compute the integral | e™cos Вх (a = 0, 8 #0). 
~ Applying integration by parts, we put u = е“ and dv = cos Gx dx (or 
и = cos Вх and dv = е" dx) so that 
sin Вх. 
E 


du = ae™ dx and v = [os Bx dx = 


Then 


1 3 a : 
[es Bxdx =— e“sinBx —— | еіп Bx dx. 


B 8 


Integrating the latter integral by parts gives: и = e and dv = sin Bx dx 
cos Bx 


B 


|” sin Bx dx = — l е cos Вх + 2 fer cos Bx dx. 


8 8 


Substituting this identity into the preceding one, we obtain 


so that du = ае°ах and v = and 


2 
[e cos Bx dx = à e?* sin Bur ех cos Bx je cos Bx dx. 


Thus applying integration by parts twice we have arrived at the equation 
in one unknown, whence we have 





2 ах 
(: + 82) jen cos Bx dx = r (о cos Bx + 8 sin Bx) 
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and 





ах 
Ie cos Bx dx = 22 (а cos 8х + Bsin Вх) + С. 
а 


m 
Analogously, we can compute the integral 


[e sin Bx dx = NH (o sin 8х — 8cos Вх) + C. * 
a’ + В 


Integration by parts is helpful to compute a number of integrals given 
below. 


n 
Integrals of the form [P469 In x dx, where Р(х) = У) сүх“ is the nth 
k=0 
degree polynomial. If we put 4 = In x and dv = P,(x) dx then du = oe 


and v = \Р,(х) dx = О, + 1(х), where Оп + 1(х) is a polynomial of degree 
(n + 1), and integration by parts gives 


Е = Ол +1(х)1ах — Qum dx 
= Qs. iG lnx — Нь. 0) + C, 
where Нһ+1(х) is a polynomial of degree (n + 1). 
Example. Compute the integral (ux + 2x) In x dx. 

4 Putu = In хапа dv = (4x? + 2x) dx so that du = © and v= fax 4 
2x) dx = х* + x?. Then integration by parts gives 

[ex + 2x) Inx dx = Qf + x?)Inx — [o + х) dx 

= (4 + eying — X o +С» 


Integrals of the form [Pn (x) tan ^ ! ax dx and [Pa(x)cot 71 ax dx, where a 
is a real number. These integrals are reduced to those of rational functions. 
For example, if we put и = tan! ах and dv = P,(x) dx in the former in- 


a dx 


tegral then du = quu and v = Оһ+1(х) and integration by parts 
ax 


gives 
P(x) tan~! ax dx = Qn+1(x) tan”! ax а Олз). dx. 
1 + ах 


Analogously, we work through the latter integral. 
Example. Compute the integral [ax + Dtan ^ !x dx. 
dx 


+X 


and 





4 Put и = tan !x and dv = (3x? + 1) dx so that du = j 
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v = x? + х. Then integration by parts gives 


3 
(С + Dtan ! хах = (x? + х)їап !x - | + ах 
х 


1 


› 
= (х? + x)jtan !x- |а = (х5 + xtan ix- © + С. >» 


2 


Integrals of the form [P460 sin ^! ax dx and | Р(х) ċos ax dx, where 
a is a real number. To integrate these integrals by parts we put и = sin ! ax 


(и = со8 "ах in the latter integral) and dv = Pn(x) dx. Then 


du = 2 (au = - a dx ) $0 


МІ + ox V1 - ох? 





and 
| Pato зіп! ax dx = Qs. 1(х) sin"! ax — a | a 
1 — ах 


The integral in the right-hand side сап be computed by using different 
methods which will be discussed in detail in Sec. 9.4. 
Example. Integrate by parts [xsin ~} x dx. 








iic 2 dx -. Gy 
ч Putu —sin ‘x and dv = х dx. Then du = and v 3 
1-х 
Integrating by parts, we get 
3 3 
xn t xdx = X- ад ed | x4 
3 3 [ux 


Ву the substitution = V1 — x? so that x? = 1 — t? and x dx = -tdt 
the integral on the right is reduced to 


х! dx х? [t 
== | mc d 
- |ё-0й-10-08С--406-2У1-3 + C. 








Finally we have 


3 
Б = > sin” x4 (х2 -2N1-x.- 
Integrals of the form [Pa e™ dx where А is a real number. Putting 


и = P,(x) and dv = e" dx we obtain du = Р;(х) dx and v = l e ( з 0) 
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so that 


| Pate e“ dx = 1 Р(х) е – (рхо) e" dx, 


1 
Х 
where Р(х) is a polynomial of degree (п — 1). 

Then applying integration by parts to the right-hand integral we again 
obtain the expression which can also be subjected to integration by parts. 
Continuing this process п times we finally arrive at the integral 


ЕСЕК С: 


Example. Compute the integral fo? + 2x)e'dx. 
ч Put u = x? + 2x and dv = edx So that du = 2(x + 1) dx and v = e. 
Then integration by parts gives 


fo + 2x)e'dx = (х? + 2x) e — 2|(х + 1) ах. 
To integrate the right-hand integral by parts we put и = x + 1 and 
dv = e'dx. Then du = dx and v = e' so that 
[x + 1)е*ах = (x + De'- [edx = (x + De'-e + C = xe + С. 
Finally, we have 
а + 2х) edx = (x? + 2х) е" — 2xe + С = хе! + 0C» 


Integrals of this form can also be computed by the method of compar- 
ing (unknown) coefficients. In this case we assume that the integral is equal 
to the product of the nth degree polynomial 


ООо) = bo + bix +... + Бах" 
with unknown coefficients bo, bi, . . ., bn by the function е“, i.e., we assume 
that 

[P460 e"dx = Q,(x) е“. 


On differentiating the integral we get 
Р,(х)е" = On(x)e™ + XOn(x) е“ 
whence, dividing by e" з 0, we get 
Р„(х) = Ол(х) + А О„(х). 


Since the polynomial on the left is to be identically equal to the sum 
of polynomials on the right, the coefficients of any power of x must be 
equal; so comparing coefficients, we obtain the system of (n + 1) linear 
equations in п unknowns bx. This system has a unique solution, for the 
respective determinant is distinct from zero. To demonstrate how this 
method works we shall compute јо? + 2x) e'dx. 
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а Put 
[GC + 2x) e'dx = (bo + bix + Бох?) e", 
where bo, bi, b; are unknown coefficients. 
On differentiating this identity, we get 
(x? + 2x) e = (bi + 2b;x) е" + (bo + bix + box”) е". 
Whence, division by e" = 0 gives 


2x 4 X^ = bo + bix + box + bi + 2box 
or 
2x + x? = (bo + bi) + (bi + 2b) x + box’. 
Comparing the coefficients of equal powers of x, we arrive at the system 
of linear equations 


x? 0 = bo + bi 
x" 2= +2b }, 
х? 1 = 
whose solution is bo = 0, b; = 0 and b = I. 
Hence the original integral becomes 
fo + 2х)е ах = x7 + C. e 
Integrals of the form [5.09 sin Bx dx and [P460 cos Вх dx, where 8 


is a real number, 8 z 0. 
ч Putting u = P,(x) and dv = sin 8x dx (dv = cos Вх dx in the latter in- 


tegral), we have du = Р (х) dx and v = шин ын (v = sing), Then 





2 sin Bx dx = -Р,(х) P Bx + 2 | Pics) cos Bx dx. 


Applying integration by parts л times, we finally obtain 
[sin Bx de = -2ЭР- and [eos Bx dx = ARE. » 


Example. Compute the integral (e — 1) cos x dx. 


ч Put u = x? — 1 and dv = соѕ хах so that du = 2x dx and v = sin x. 
Then integration by parts gives 


fo - Dcosxdx = (х? — 1)sinx – 2(x sin x dx. 


Naw we put u = x and dv = sin хах so that du = dx and v = —cos x. 
Then we have 
e — 1)cosxdx = (x? — 1)sinx + 2xcosx — 2sinx + C 
= (х? — 3) ѕіпх + 2xcosx + С > 
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Integration by parts can also be helpful in dealing with some other in- 
tegrals apart from those considered above. By way of illustration we shall 











compute the integral | хах Я 
snx dx 

4 Putting и = x and dv = —5—, we obtain du = dx and v= 

dx sin“ x 
| —,— = —cotx so that 

sin“ x 

| ae = — xcotx + [oot xax 
sin^ x 


By the substitution / = sin x and dt = cos x dx the integral on the right 
is reduced to 


[cot xax = e = p = а || + C2 In|sinx| +С 
sin х t 


Finally, we have 


onde = -xcotx + In|sinx| + C » 
sin^ x 


9.3 Integrating Rational Functions 


Rational functions. The simplest rational function is the nth poly- 
nomial 


О„(х) = aox” + aix" +... + ds aX + аһ, 


where the coefficients ao, di, ..., аһ are real numbers and до # 0. 

The polynomial Q,(x) is said to be monic if ao = 1. 

The real number b is called the root of the polynomial Q,(x) if 
Q,(b) = 0. 

It is known from algebra that any real polynomial О,(х) can be uniquely 
factorized into a product of monic linear polynomial x — b and monic 
quadratic polynomials x? + px + q, where p and q are real coefficients and 
every monic quadratic polynomial is irreducible to a product of linear poly- 
nomials, for it has no real roots. Then gathering equal factors if any, we 
can write the monic polynomial Q,(x) in the form 


On(x) = (x - a(x— b). . . œ- DG? + pix gi. . . OF  psx qs), 


where the exponents a, 8, ..., №, gi, ..., ps аге natural numbers. 
It is easy to notice that for the nth degree monic polynomial Q,(x) the 
following condition is satisfied 


at+tB+...+4+ 0ш + pot... tps) = п. 
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When а = 1 the root a is said to be unrepeated or simple; when a > 2 

we say that a is a repeated root of Q, and call the number o the multiplicity 
of a. 
Pm (x) 
On(x) 
Pm(x) and О„(х) which have no factor іп common. 
Р„(х) 
On(x) 
tion if the numerator Р(х) has a degree lower than that of the denomina- 
tor О„(х), ie, if m « n. If m 2 n we can apply the division algorithm to 
express f(x) as 





A real rational function f(x) is the ratio of two real polynomials 


The rational function f(x) = is called the proper rational func- 








_ Pm(x) _ P(x) 
Оо AE : 
5 : P(x) 
where А, – „(х) and Р(х) аге real polynomials and 0.00) 


tional function. For example, if we apply the division algorithm to the ra- 


5 
tional function L +1 _ Р) 
x +1 Q»(x) 


ххх 
x + 1)х? + Ox4 + Ox? + Ox? + Ox + 1 
х? + Ox* + 1x? 
= Ix? + Ox? + Ox + 1 
— 1x? — 0x? – 1х 
х +1. 





is a proper ra- 





, we obtain 








5 
+1 — р Sob. 


so that R3(x) = x? — x, P(x) = 
241 x41 300) б) 








Непсе, Гы 
х 
xctl. : : 
Xx +1 and 211 is a proper rational function. 
x 


Splitting a rational function into partial fractions. Partial fractions are 
proper rational functions of the form 
A A Мх + N Мх +п 
, ka and —у-————у, 
х-а (x-a) х + px+q (x^ + px + 9) 





where A, M, N, a, p, and q are real numbers, k (k > 2) is a natural number 
and quadratic trinomial x? + px + q has no real root so that the condition 
2 2 
Z- -4<0 ог а- 5->0 holds. 
We turn once again to algebra and quote the following important fact. 
Р„(х) 
Theorem. 9.3. Let f(x) = —* 
function and let Qn(x) = (x — ay (x — b}. . (2 + psx + qs)". Then f(x) 





(m « n) be a proper real rational 
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is uniquely reducible to a sum of partial fractions of the form 


P(x) _ A А Aa 


+ p bet 
On(x) x—a (x — a) (x — a) 
Bi В, В 
+ — +- +... +r 
x-b (-by (x — Б)? 
Mix + № Mox + № M,x + №, 

+ —5 potet 58—80, 

X^ px + Gs (x^ + psx + Ф) (x^ + psx + qs)! 


where А}, 4», PED Aa, By, B, a a жу Вв, 238653 Mi, Ni, M2, №, — M, 
N,, are real numbers not all equal to zero. 

To define the coefficients in the numerators of the partial fractions we 
multiply the left- and right-hand sides of the preceding expression by О,(х) 
and apply the method of comparing (unknown) coefficients, i.e., we com- 
pare the coefficients of equal powers of x on the left and on the right thus 
obtaining a system of linear equations in the desired unknowns. The solu- 
tion of this system uniquely defines the coefficients we are seeking for. 

Sometimes it is convenient to use another method of computing coeffi- 
cients of partial fractions. If two polynomials are identical the identity 
holds for any value of x; then multiplying both sides of the expansion given 
in Theorem 9.3 by Q,(x) and substituting some specific values for x into 
the identity thus obtained it is possible to get simple equations for the 
unknown coefficients. The method is often useful when Q,(x) has only 
simple real roots, substituting the values of the roots for x we arrive at 
simple equations for the unknown coefficients. 

To illustrate these methods we use some specific examples. 

3x? — 6x +2 


Examples. (1) Split the proper rational function ———_,—-— into 
x” — 3x* + 2x 














Teese 


partial fractions. 
ча On factoring the denominator, we get 
(x — 3x? + 2x) = x(x? — 3x + 2) = x(x - (х - 2). 
Clearly, the denominator has real simple roots. Then we can write 
2 = 
3x 6x+2 А " B i: C 


Xx*-3x 2x  -Xe1 yoo. 





Multiplying both sides by the denominator, we obtain the identity 


3x? — 6x + 2 = A(x — I(x — 2) + Bx(x – 2) + Сх(х 1) (+) 
or 
3x! — 6x +2 = (А + В + C)? + (-3A — 2B – С)х + 2A: 
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Applying the method of comparing coefficients, we arrive at the system 
of linear equations in the unknowns A, B, C 


x | А-В-0-3 
x! -34-28-0--64, 
x? 2A 22 


whose solution yields A = 1, В = 1 and C = 1. 

We obtain the same result by substituting the values of the roots of 
the denominator for x in (*). Indeed, the roots are x; = 0, x» = 1 and 
X3 — 2. Then the identity yields 


2=2A and А = 1 for x = 0, 





—1= —B апа В = 1 for x = 1, 
2=2C and C= | for x = 2. 
Hence 
3x7 -6x+2 1 1 1 
E D tf ed ae ЭР TE — = ». 
x? — 3x? + 2x oe p 77:27 
x!'-3x41 





(2) Split the proper rational function --; 


into par- 
x + Зх“ + 3? + x? 


tial fractions. 
ча The denominator can be factorized as 


х? + 3x4 + 33 + x? 2x) + 3x? + 3х + 1) = (x + 1). 


Thus it has the repeated root xi = 0 of multiplicity 2 and the repeated | 





root x. = —1 of multiplicity 3. Then 

3 
зэс eM н Бар кой. pL cns. TERT эн 
X + 3x44 Зх? 4x x er xtl (-41 (x + 1) 


Multiplying out, as before, we get 


х + 3х + 1 = Ayx(x + 1) + A(x + 12? + Bix?(x + 1? 
+ Box?(x + 1) + Вех? (жж) 
Or 
X) + 3x + 1 = (Ai + Bi! + (3А, + А + 2B, + Bx? 
+ (3A; + 342 + Bi + В + Вз)х? + (А, + 3A2)x + Ad. 


The method of comparing coefficients gives 


Х | Ai-B-0 

х ЗА, + Аз + 2В; + В = 1 

х? ЗА: + 342 + Bi + B + Вз = 0 |, 
x | А +342 = 3 ` 

х? | A521 
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so that А; = 0, А = 1, В, = 0, В, = 0, Вз = —3 and 
х + 3x41 1 3 


х + 3хё + Зх x x (+1) 





Proceeding as in the previous example, ме put x = 0 and x = –1 in 
the identity (++). This yields Az = 1 for x = 0 and By = —3 for x = - I. 
Substituting these values for 42 and B3 into (**), we get 


X + 3х + 1 = Аүх(х + D! + (x + Dy + Bix? + 1y 
+ Bx? (x + 1) — 3x” 


or 
х + 3x4 1 — (x 1) + 302 = Axl + 1)? + Bix?(x + 1)? 
+ Box*(x + 1), 
whence 
0 = Aix(x + D? + Bix?(x + D? + Box*(x + 1) 
and 


Ai(x + 1)? + Bix(x + 1) + Вх = 0. 


Put х= 0 and x = —1. Then A; = 0, B» = 0 and, consequently, 
B, = 0. Thus we have the same values for the coefficients as those obtained 
by the method of comparing coefficients, namely, 


Aı =0, Æ = l, Bı = 0, В = 0, B = – 3. в 


: ыз, хх 1 
(3) Split the proper rational function азу into partial frac- 
x 
tions. 
“4 The denominator has по real roots since x? + 1 does not vanish for 


any value of x. Then there must hold 
X++] Мх-М Мх + № 
Se ior beana 
(x^ + 1) х +1 (x^ + 1) 
Whence 


4x7 + 1 = (Mix + N)Q + 1) + Мәх + № 





or 
x? + х®+ 1= Mix? + Nix! + (М, + Mix + (М, + №). 


Comparing the coefficients of equal powers of x, we get 
Mi-L,N-LMi-*M;-0OMNI + № = 1. 
Whence 
М-1,М-1М-1-1 № = 0 
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and, consequently, 


x-xl-41 х +1 х 


„> 
(х? + 1)? х2-1 (+1) 





It is worth noting that sometimes it can be easier to arrive at the desired 
result without applying the method of comparing coefficients. For instance, 
in example (2) after a little algebra we obtain 








20 X £ 3x & d _ GP + 3x7 + 3x + D – 3x? 
х? + 3x4 + 3x? + x? x(x + 1) 
IO ot) ae) 2-1 3 
хх + 1) хо œ+’ 


Integrating partial fractions. A rational function can be uniquely 
represented in the form of a sum of a polynomial (a zero polynomial in 
the case of a proper rational function) and a proper rational function which 
can be splitted into partial fractions. Since any real polynomial can be easily 
computed by applying standard integration formulas for elementary func- 
tions, integration of a rational function becomes integration of a sum of par- 
tial fractions. So now we turn our attention to techniques of integrating 
partial fractions. 

Using standard integration formulas we can easily integrate partial frac- 
tions whose denominators are monic linear polynomials or power functions 
of them, namely 


| A%-4 | 46-2. - Am|x- a] + С. 
x-a х- а 








А ах -k A -К+1 
——— =A {(х-а) *4(х-а) = x-a +C 
КЕТЕ = 1S, ЕС 
= ED 
(1 — k)(x — a) 
To integrate a partial fraction of the form MEX Ed we apply the 
x + рх+ 4 


method of completing the square in the denominator. This yields 


2 2 

2 2 P (Р - {Р 

spiega [x emp (5) | +4 (5) 
2 
р "LR 
Эм) 


Since the second summand is positive we set it equal to a? where 


2 
а= + jq- T . Then tie substitution x + CE t gives dx = dt and 
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x! «px + q = 1? + a’. By virtue of the linear property of the indefinite 
integral we have 


u(- 7) +N 

Mx+N х2 2 

Saas тулах А + Sg ge ít 
х + px+q і ra 


ЫН ий _\ (~- | d. 

te +a? 2 +a 
ae" + ary a у + (w- 52 220 
xa 2 1 + а° 


In(t? + a?) + (х- 22); tanc! +С 
a a 








In( + px + 4) + Жез, dán 
У 44-р V4q - р? 


Р Ес = 


we work through | dx 
x 
“4 Clearly, the quadratic polynomial х? + 4х+ 6 has no real roots since 


2 
T -q = –2 < 0. Completing the square, we get 


х + 4х + 6 = (x) + 4x + 4) + 2 = (x + 2Y + 2. 


The substitution x+2=f yields dx=dt, x=t-2 and 
x? + 4x + 6 = г + 2. (Observe that here а? = 2.) Then 


eee дез [2462 dt 
х + 4х+6 t^-2 


-4| а -3| миа _ 
+2 2 ){?°+2 
cool tinto E esL 7 276 46 
v2 2 2 Е 


Saa Es ~lingg+ 4469407 
v2 2 








To compute the integral of a partial fraction of the form 


PaA (k 2 2) where the denominator is irreducible to a 

(x^ + px + q) 

product of monic linear polynomials we proceed as before by putting 
2 


x += t so that dx = @ and a= q- Æ. Then 
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Mx +N Mt + (N- 5р. 
MEEN os oie — dt 
| (х2 + px + q)* | (2 + а?) 





_ М о ouk у 2 _ Mp dt 
eso ae +a) + ( 2) I 


Е M _ Mp dt 
с 20 - ky? + a] Ё (~ 2 )| (^ + а?) | 


We denote the integral оп the right by Ј and express it as 


2 2 2 
1 | gu) 1 u 


Jk = | . at puni 2 R 
(2 + а?)“ а? (£? + а?) 


1 do č 1 Cdt 
(1? + ay a? (12 + а?) 














а 











1 1 d(t? + а?) 
2,dk-i—- VE OS р. 
$5 әд? | (12 + а?)“ 
To compute the right-hand integral we apply integration by parts by 
2 2 2 2 
putting и = ¢ and dv= ша Lr so that du = dt and v= Че к, = 
(1^ + a^) (t^ a^) 
РЕГ East RED 
(1 — Ki? + ay! 
Pay 1 t p ud dt r] 
Cg 7 |а Юи? a 1i-k | (y^! 
1 1 t 1 
= 5- Jk-1 7 ou [ee рч + 1-4к- 
а? k-1 2a F PA ky? + ay k Е 1 k | 
or 


t 2k — 3 


——-——— Јер. 


J = -— ——- =- -a= -- SEL EE ACIE n Vi Va aa: Кагана 29 
57 24 (k- 002 + а? 2420 — 1) 





Thus we have arrived at the recurrence relation which yields the value 
of Jx for any k (k = 2, 3, .. .). Indeed, the integral J; is easily computed 
by using a standard integration formula, 1.е., 


A= | dt lacia 





Ücgq a a 


For К = 2 the recurrence relation gives 





зоо а t QUÀ 
(12 + а?ђ? 2а? (t? + а?) 2а? 

2 t 1 -14 
uua a ute 
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Then we can easily compute Л by putting k = 3 in the recurrence rela- 
tion; continuing this process we can find the value of J for any preassigned 
К. Substituting the relations for / and a into the expression obtained for 
a given К we finally represent the result in terms of x and M, М, p and 
q involved in the original integral. 


(х2 — 4x + 5) 
“4 Clearly, the integrand is a partial fraction similar to that considered 


Example. Compute the integral | er —- — dx. 


above since the denominator has no real roots due to the fact that 
2 
Р. q = —1 < 0 апа, hence, is irreducible to a product of monic linear 


polynomials and the nominator is a linear polynomial. Then completing 
the square, we have 


х2 – 4х +5 = 0? – 4х + 4) +1 = (x-2Y +1. 
The substitution x — 2 = t (а? = 1) yields dx = dt and x = t + 2 and 


| х +1 ax = | +3 a=; | 21 dt 








QË – 4х + 5 XP 3-1 2 J] +y 
TE NE 224 
(£? + 1)? 2012 + 1) (12+ 1)? © 
Put & = 2. Then 
dt t 1 dt t 1 -1 
= + A + ас 1-0 
| (^1) 202441) 2 | (2-1 28-41) 2 
Непсе 
х-1 1 3t 
— 5 dx= – + : 
| Qi-4x45p ^ 20141) 20241) 
3 -1 3t == 1 3 24 
`t t + С = — + 5іап +С 
ат 202-1) 2 


Finally, the substitution x = £ + 2 gives 


xctl 3x —- 7 3 =i 
iyt 202 — 4x +5) + ап (2-2) + C» 
To summarize the results outlined іп the preceding discussion we quote 
the following theorem. | 
Theorem 9.4. The indefinite integral of any rational function always 
exists on intervals where О„(х) з 0 and is expressed as a composition of 
a finite number of elementary functions, namely, as an algebraic sum that 
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can only involve polynomials, proper rational functions, logarithmic func- 
tions or arctangents. 

It is instructive to finalize the general procedure of integrating rational 
functions as the following sequence of steps: (1) express a given rational 
function (an integrand) if necessary as a sum of a polynomial and a proper 
rational function by applying the division algorithm for polynomials or 
any other suitable technique; (2) factor the denominator of the proper ra- 
tional function into linear and quadratic polynomials; (3) split the proper 
rational function into partial fractions; (4) compute the integral of the given 
functions as a sum of integrals of the summands obtained. 

In conclusion of this section we shall consider a number of examples. 

Examples. Compute each of the following integrals. 

х 

0 u^ 
-4 Since the integrand (x — 1)(x? — 4) = x? — x? — 4x + 4 is not a proper 
rational function we apply the division algorithm for polynomials 

1 z 
x! — x? — 4x + 4)x? + Ox? + 0x + 0 
x!- 1x! - 4x —- 4 

х? + 4х — 4. 








Hence, ће integrand becomes 


ae x + 4х – 4 


тота 4 uw ay! 


where Ro(x) = 1 and P(x) = x? + 4x — 4. 

The denominator of the proper rational function has three distinct roots 
a=1, bh = 2 and с = —2, then this function can be splitted into partial 
fractions as 

x'*4x-4 _ A B C 


(х= 002-4) x-1 trel MD 








Whence 

x! + 4x — 4 = A(x? — 4) + B(x - D(x + 2) + C(x - (х - 2). 
Setting x equal to the values of the roots, we obtain 

A = – 1/3 for x = 1, B = 2 for x = 2 апа С = -2/3for x= -2. 
Then 


АА а а мы. 7 
(x — DG — 4) 3 x-1 x-2 3 








28— 9505 
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and 








x? dx b. E. E 
pol e lees EU. . шаг, d 
|o em (1 3 x-1 К х-2 3 х+2 ^ 
-x-im|x-1|*2m|x-2|- 2in|]x 2| £ С > 
2 
1 
2. отл сыр 
o [3 


ча The integrand is a proper rational function whose denominator has dis- 
tinct roots x = 0 of multiplicity 1 and x = 1 of multiplicity 3. Then splitting 
into partial fractions yields 
2 
x +1 Ai Аз Аз В 
= 21 + + + z 
x'—x x ХЭЭЛ aee] 





Multiplying out and dividing by the denominator of the integrand, we 
arrive at | 
x? + 1 = Ах (х — 1) + A(x — Dx + A(x — 1) + Bx? 
ог 
x! + 1 = (Ai + В)х? + (— А, + Аз) + (— А; + Аз)х — Аз. 
Comparing the coefficients of x of equal powers, we get А, + B = 0, 


-А + 42-1], -42-c 435-0 and -– А; = 1. Whence A, = -2, 
Аз = —1, Аз = - 1 and B = 2. Then 

2 

xb. :2525..1 Lbs si 2 


at cy x x x x-1 


х* +1 Е 2 1 1 2 
Е 20507205 
| _+2injx-1]+C> 
X 





and 








= —2In|x| +1+ 7 
(3) | E зз. dx. 
G^-1 ^ 
~ Since the denominator has no real roots the integrand can be written as 
x!-x Mix + № Мх + № 


Так al) о) 
Then 
x — x = (Mix + Му)? + 1) + Мх + № 
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Or 
х= x = Mix! + Nix? + (Mi + М)х + (Ni + №). 
Comparing the coefficients, we have 
М, = 1, М, = 0, Mi*M;--1], №+ № = 0 
so that 
M,=1, М-0, M--2, М=0 


and 
2 х= х х 2х 
BRL ORE dx = 4 
| ais T Ї 524 geyl 


1 1 
= zme +1)+ Spt С. 


+ 1 


Observe that the integrand can easily be splitted into partial fractions 
by using simple algebra as 








хУ—-х | (Qà-x-2x | xoà + 1) - 2x 
(x? + 1)? Q + 1)° (x? + 1)? 
x 2x 





etl +1?” 
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Rational and irrational functions. We consider functions in many 





variables ит, u2, ..., ик. Let a function R(m, 12, ..., ик) be represented 
in the form 
Pm(ti, U2, ..., Uk) 
R(u,,... ик) = oe i ы ; 
( : » Uk) О„(и\, U2, ..., ик) 
where Pm(t, 15, ..., ик) and O,(u1, U2, ..., ик) are the mth and nth 


degree polynomials іп ui, 10, ..., ик, respectively. Then R(t, 10, ..., 
ик) is a rational function in 24, 25, . . ., uy; otherwise, it is irrational. For 
example, a quadratic polynomial in variables иг and uw takes the form 
Р, (i, u2) = Ао + Alou, + 40102 + Ази? + Анин) + Ao us where the 
coefficients Aoo, Ato, Aor, 420, Aii, Ao? are real numbers and A2, + 
Aii + Аф = 0. He 

: х t 2y + ху 

It is easy to observe that f(x, y) = FIN eem 
in variables x and y since this function is a ratio of the third degree poly- 
nomial Р(х, у) = x? + 2y? + xy and the fifth degree polynomial 
Vx? – 2ҳу + 3 


Os(x, y) = x ху + 1, while Ах, у) = <a aE is an irrational 


is a rational function 


function. 


28* 
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Suppose that variables ui, u2, . . ., ик are some functions of a variable 

x, ie, иу = fi(x), u = р(х), ..., ик = fx(x). Then the function А [л (х), 

Р(х), .. - Л) is a rational function in functions fi (х), A(x), . . ., fx (x). 
х2-мХ txt 1 


For example, f(x) = 
x+14+3VX 4x41 
x and in Vx?+x+1 so that f(x) = R(x, Vx?+x+1). It is worth noting 





is a rational function in 


2 
I Vx, ЖЭТ rt : 
that f(x) - 8032 is an irrational function in x and in V x? + 1 
sin x 


while it is a rational function in functions In x, eVX' *! and sin x? so that 


f(x) = Ёйпх, еу *!, sin x’). 

It is not hard to notice that sometimes integrals involving irrational 
functions do not admit representations in elementary functions. For exam- 
ple, the integrals 


2 
| ur and | шах (0<k<1) 


Xa —x?)(1 — kx?) Xa —x?)(1 - &?x?) 


called the e//iptic integrals of the first and second kinds, respectively, can 
not be expressed in elementary functions. However, by suitable substitutions 
it is often possible to reduce integrals involving irrational functions to those 
of rational functions. Below we shall consider techniques applicable in deal- 
ing with integrals of some specific forms. 








Integrals of the form \к(® m att) dx where m > 2, R(x, y) 
Я сх + а 
is a rational function in x and у = m SE and the coefficients a, b, c 


and d are real numbers such that ad — bc = 0; we leave aside the case when 


ad — bc = 0, for then 29 and -2* + Dd ob so that the integrand 


d ox+d c d 
becomes a rational function only in x and its integrals have already been 
studied in detail. : 

The substitution 


iz ax + b 
AL cx +d 


m axt+b 
сх+а” 





yields 


(сх + dy" = ах + b and ах – cxt" = dt^ — b 
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so that 
dt" — b 


y= 
a — сі" 


is a rational function in /. 
On differentiating x with respect to f, we have 
dmt" (а — ct") + cmt" аг" — b) dt 
(а – сі")? 





ах = 


and further 


(ad — bc)ymt" ^! 
(a — сі")? 


[R(x n жї ae 
A сх + а 


т m-i 
_ R( 2 mn) mi" "(ad — bc) а = \кцо, 
a – сі" (a — ct") 


dx = dt. 





Then 








m 
where Ri(f) is a rational function in ¢ the factor R (2-3 , ) being 
a function in a rational function is again a rational function and hence 
the product of this factor by the second factor which is a rational function 
in £ is again a rational function in f£. 

Therefore we have reduced the original integral to that involving rational- 
functions which are well familiar to us. 
Let 


(Roat = Е) + C and F'(t) = Ri(f). 


Then the original integral becomes 


ax+b Ш ах + b 
(а(х M сх + 2) ё F(x cx + 2) De 
By way of illustration we consider two examples. 
Examples. Compute each of the following integrals. 


(1) | 4 2-3 ах 
2x 3. Qx e 3 ` 
ча Observe that the integrand is a rational function in x and in 


| 12x-3 , = 
у= аруусу, e R(x) = 








y 
(2х + 3)? ` 
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2-3 mM 4, 3 1+ _ 
Put 6-743: Then 2x 3 = 2xf + 37, 729 12 


3 
3 223) сл ааф ea е é -. Hence the 
2Х1-1 0-1) 1-1 


original integral becomes 


2x - 3 dx - 4—2 1247 
N2x £3. Qx +3) 36 a - ey 














- dt 





5 
1 4 1 5 1 2x — 3 
= = t um » 
(еа = Бс x сүз) +С 


(2) | го 
Ух Т. Ух + Vx) | 
4 Evidently, the integrand is of the form R(x A). Put ¢ = Vx. Then 
х= 12, Vx = P, Ух = C, Ух = (6 and dx = 1214. Hence 


Ix dx -| tdt — | Га 
З 7034 & 12 таат в 
Vx(Vx + Vx) ee +t) 


2f (-031д- o | (e -1e 313 at 
i+? n 


Ü 
-n( -t + tan- Ч) +C 


= 4¥x — 12'Vx + 121ал7! Yx + C >» 


Integrals of the form fr ( х, Мах? + bx + c ) dx where the integrand 
expressed as R(x, y) is a rational function in x and in y = Vax? + bx + c. 

By suitable substitutions dependent on the coefficients a, b and c these 
integrals are reduced to those involving elementary functions. Three cases 
are to be distinguished. 

(i) Let a > 0. 

Then the substitution 


= Мах? + bx + с + Мах 








yields 


(t — Vax)? = ax? + bx + c and 1? — 2Vaxt = bx + с, 
so that 
EE. 


2Vat + b 


is a rational function in f. 
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On differentiating x with respect to 4, we get 


2t(2Vat + b) — (t? — су ха dt 22 Vat? + bt + cva 


e (2Vat + by. (QQVat + by 








We also have 


2 _ 
Мах + bx4c =t— Мах = 1 Ya 16 


Wat + b 
Vat? + bt + ceva 
Wat + b 


Thus x, dx and Vax? + bx + с are all rational functions in / and we 
can write 


|^ (s Мах? + bx + с )* - [nons 





where 





-c a e; Vat? + bt + eva 


1 (2Vat + by. 


t 
R H =R » 
i (3555 2Уа! + b 


is a rational function in f. 

Observe that the substitution / = V ax? + bx + с — Vax also reduces 
the original integral to that involving rational functions. 

Example. Compute the integral 


| dx 
Nx! + о? 
202,5 
4 Since а= 1» 0 we put = Vx? + o + х. Then х-4-п уу» 


2 2 2-52 2 2 
dx = LT di and Уд + а? = 1 - 8 = EEA so that 


dx 2t 2-6 dt 
1—2 | P+ 27? | t 4 
=ш|х+\ух? + o? | «C. 
Noting that x + Vx? + o? > 0 for all x, we finally arrive at 


dx 2 2 
= In| (x+ VŽ 4+ 07) | 40 
uta 


Problem. Prove that 


dx 
= Inf х 4 Nx?- o? | «G |x| » |a. 
|a 
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(ii) Let the quadratic trinomial ax? + bx + c have distinct real roots 
хі and хэ (let the coefficient b be of any sign). 
Then the substitution 


Мах? + bx + c -(x—x)t (or Vax? + bx * c = (x — x)t) 
yields 
a(x — xiXx — x) = (x — xit? (or a(x — x) = (x — xt) 


since ax? + bx +c = (x — x yt. 
Whence 


xt? – ax dy = 2460 — ХӘ a, 
= 2 2 


x= : 
12-а (t^ — a) 


and 


225 — 
Vax + bee = (5708 - a)i 9m 


t-a 2-а 


Since x, dx and Vax? + bx + с are all rational functions in f the origi- 
nal integral becomes 


|^ (s Мах? + bx + 3r - [non 


where 





хи? -an  a(u-x)t \ 2a(o – х) 
t-a t^-a 


к= R( , (t? — ay 


is a rational function in f. 
Example. Compute the integral 


| dx СО 3 
x-2N1-x 











44 Since 1 — x? has two distinct real roots x, = —1 and x; = 1 we put 
Vi-x =(+xt (or V1- 2 = (1- xy). 
Then 
Е ОО SEE. 
1+{° 
2 
x UNE EL M pee = а зас. 
2-1 1-1 (1 + t^) 
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and 
dx (£? + DG? + D4tdt. _ 2 dt 
(x — 2N1- (31° + D2( + 1)? 32 + 1 

2 dt 2 21 

= Е tan 31+ C 
3 a 2 1 УЗ д 

= _2_ -1 l-x » 

SAEC tan 3 dx + С. 


Gii) Let c > 0. 

Then the substitution Vax? + bx + c = xt + Ve (or Vax? + bx + с = 
xt — Vc) easily reduces the original integral to that involving rational 
functions. 

It is worth noting that it suffices to apply the substitutions in (i) and 


(ii) to reduce integrals of the form {RO Мах? + bx + с ) dx to those 


involving rational functions. Indeed, if b? — 4ac > 0 then the roots of 
ах? + bx + c are real and the substitution given in (ii) yields the desired 
result. If b? — 4ac < 0 then the sign of ax? + bx + c coincides with that 
of a; since ax? + bx + c must be positive, then a > 0. In this case the sub- 
stitution shown in (i) leads to what we are seeking for. Notice that some 


integrals of the form {А (x Мах? + bx + с ) dx can easily be computed 


without applying the substitutions mentioned in (i)-(iii). Below we shall 
consider three specific cases. 


dx 


Мах? + bx + с 


suitable to complete the square of the radicand as 


а з“ 0. Here it is 





(a) Integrals of the form | 


addo ке 
2a a 


2 2 
= a| (x + 2x + 2 pE сн 
4а а 4a 
2 
lea + 5с-Ь | 
4а 
2 2 
4ac— b? _ b 
1E. г) + 24 - (+2) + р, 








Дас — b? 


where p — Ja 
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Then the substitution Г = x + -Ё yields dx - аг and 





| ах -| dt 
Мах + bx*c Nat? +p 


where а and p are of opposite signs or both positive. 

When a > 0 and p > 0 and when a « 0 and p « 0 the integral is ex- 
pressed by a logarithmic function; when a « 0 and p > 0 the integral is 
expressed by an arcsine as the following two examples illustrate. 


Examples. (1) | dx 


Ух -4х+5 ` 


-4 Since x? — 4x + 5 = (x — 2)? + 1 then the substitution x — 2 = t gives 
dx = dt and 





Р ах -| dx -| dt 
Мх? -4x + 5 Мех = 202 + 1 М? + 1 
= In(t + +1) +С = (х2 + Ух? – 4 + 5 +С т 


0) |= 
N 6x — x? 
4 Since бх — x? = —[(x? — 6x + 9) — 9] = 9 — (x — 3)’ the substitution 
X — 3 = t yields dx = dt and 


(= = | cie C=sin XL ce 


мМбх х? м9 1? 3 3 


(b) Integrals of the form | Mx + N 


Vax + Ьх + с 


(ax? + bx + c)' = 2ax + b, we reduce the original integral to that con- 
sidered above by proceeding as 





ах, аж 0. Noting that 





1 
Ч М zg [бах + b) - E] +N 


| Mx +N к= | 
Мах? + bx + с Мах? + bx + с 


ах 











- м | Qax + b)dx +(w- AI dx 
2а J Vax? + bx +e 2a Vax? + bx + с 
- 1 аы. (мм) | ах 

24) Уа + bx 4 c 2а Мах? + Бх + с 





=i ac + be +e +( -| a 


2a Мах + bx + с 
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Example. Compute the integral 


(22-а 


N6x- х? 


“4 Noting that (6x — x”)’ = 6 — 2x, we obtain 


x1 1 -25-2 4. 1166-20-86, 


meu E 2 |95225 


dx 1 | Hee) .-1X-3 fea 
=4 = 4sin —N6x'—x* + C. 
| Мєх-х 2 J Vb6x—x? 3 


(c) Integrals of the form | Tae 


nomial of degree n. Here we apply the method of comparing coefficients 
to find the coefficients of P,(x). 
Let there hold the identity 


1-20 — а = Оһ 1(х)М ах? + bx + с 


Jæ + bx +e 











dx where Р„(х) is a poly- 








+ An | ох ; (*) 


Мах? + bx + с 


where Оһ – 1(x) is a polynomial of degree (n — 1) with unknown coefficients 
Ao, Al, уг An-1; i.e., 


Qn-1(xX) = Ao + Aix +... + Аһ-1х” 1, 


To compute the values of the unknown coefficients we differentiate the 
above identity with respect to x so that 


Р) - = Qi i(x)Nax? + bx «c 
мах? + bx + с 
ax + b/2 + A, 


+ Qn - 1(х) 5 5 5 
Мах + bx +c Мах + bx + с 


Multiplying throughout by Vax? + bx + с, we get the identity 


Paix) = Qn- iGX(ax^ + bx + су + Qn- (x) (ax + 3) kA; 








which involves the nth degree polynomials on both sides. 

Comparing the coefficients of x of equal powers, we get (n + 1) equa- 
tions which yield the desired coefficients Ax (k = 0, 1, 2, ..., n). Then 
substituting these coefficients into the right-hand side of (*) and computing 
dx 


Мах? + bx + с 


, we arrive at the desired result. 





the integral | 
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Example. Compute the integral 


| хах 
Мх? + 2х + 2 
ч Put 
2 
| = (Ао + Ах)? + 2x + 2 
Ух? + 2х + 2 
+з] a ‘ (89) 
Vix? 42x42 


On differentiating both sides, we have 


Ae Pad 
Nx! + 2х + 2 


+ (Ao + Aix) х + 1 + А2 


М2 + 2+2 М2 + 2+2 — 
Multiplying ош by Vx? + 2x + 2, we get 


x? = Ai( + 2x + 2) + (Ao + Ах)(х + 1) + Ad 





or 
x? = 2А\х? + (241 + А + Ао)х + (Ao + 241 + A2). 


The method of comparing coefficients gives 








x? | 2А, =1 
х! Ao + 3А, =0 5 
х Ao + 2A; + Ar = 0 
Whence 
3 _1 _1 
Ao = —у, Ai Ty, 2 = 5. 
Computing the integral in the right-hand side of (**), we have 
| dx: 2 | d(x + 1) 
Ух + 2х + 2 М(х + 1? +1 


-dn(x 1 Мх? + 2+ 2 +C. 


Then the original integral becomes 


|p = 552 УЛ хаха. 
Vx? + 2х +2 2 


thine t М2 2+2) +C 
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9.5 Integrals Involving Trigonometric Functions 


Integrals of the form [A Gin x, cos x) dx where the integrand is a 
1 — 2sinx 
2 + cos’ x 

is a rational function in both sin x and cosx; g(x) = 1 + sin’ x 

Vcosx + cosx 
is a rational function in sin x and is irrational in cos x (we shall not examine 
functions of this kind here). 

X — 

23 

is reduced to that in a rational function. Indeed, 


rational function in both sin x and cos x. For example, f(x) = 


By the substitution tan t where т< x < m, the original integral 








x x х 
2 sin= cos% 2 tan 
: 2 2 2 2t 
CN x 2X x 3x 14D 
2 " 
OS — + sin^— 1 + tan^— 
кыр; 2 2 
2X - 2X 2X 
cos^— — sin^— 1 -tan^ = 
_ 2 2 2 1-4 
cos x = Г 25 gc pu us 
2 :42 2 
. 08:24 sin^— 1 + tan^— 
кш; 2 2 
x = 2ап^!1, dx = © 
1-1 


and 





: 2t 1-0CN 2а 
R(sinx,cosx) dx = VR : = |\R,(t) dt, 
[Re ) (225 БЭЭ БЧГ (во 


where Ri(t) is a rational function in t. 














Example. Compute the integral | dx . 
sin x 
ча The substitution tan? = t yields 
2 dt : 
ах 1+? dt x 
== = = — — = » 
| | J \4 а [+С Injtan 7| + C. 

140 


Sometimes the substitution tanz = t requires unwieldy computations; 


so we shall mention three specific cases when integrals of the form 
[RGin x, cos x)dx can be computed by using simpler substitutions. 
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(a) Integrals of the form [sin X)cosxdx. Here the substitution 
sin x = f gives cos x dx = dt so that the original integral becomes {RO dt. 
For example, if we wish to compute the integral 

| cos x dx 
4 + sin?x ' 
then the substitution sin x = ¢ yields df=cos x dx and 


cos x dx dt 1 -11 1 spl. 
= == tan +C=— tan = sin х} +С. В 
| 4--sin^ x | 4+ 2 2 2 G ) 





(b) Integrals of the form [R(cos х) ѕіп х dx. The substitution cos x = f 
yields sin x dx = — dt so that the original integral becomes — [A0 dt. For 


example, using this substitution we can easily compute the following in- 


tegral 
sinx ee = dt | d2-4r1) 
2 + cosx 2-1 2+1 


—]n(2 + t) + C = -In(2 + cos x) + С 





(c) Integrals of the form |К(зш х, cos х) dx where the integrand R (sin x, 


cos х) involves only even powers of sinx and cosx. Then the substi- 


dt 


tution tanx = ¢ gives х = tan! t and dx = рт so that sin? х and 


cos? x being rational functions in tan x become rational functions in f. In- 
deed, we have 











iis ume 222 tan? х PEE i 2 
cos” x + sin’ x 1 + tan’ x 1-1 
захыг E ЕЕ 1 == 8 1 2 
cos^x + sin^x 1+ tan^x 141 
so that 
2 
[Reins cos x) dx= (80 гүй 4 гыт) =з = (кю 


where Ri(f) is a rational function in t. 
To demonstrate how this substitution works we compute 





dx 
sin?^x + 4cos?x +2 | 
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2 
ч Put tanx=4f. Then go e. sin? x = — and 
1 1-1 1-1 
cos x = ———, so that 
141 
| dx -| 1 dt 
52 2 = 2 2 
sin’ x + 4cos x + 2 т ШЭНЭ, ie 
1-1 1-1 
1 а 1 25 102 1 -1| tanx 
- - їап + C = —- tan ---02) 800 
IES 3/2 v2 3/2 (>) 


Integrals of the form f sin“ x cos“ x dx where o and 8 are real numbers. 


We consider two cases when the integral admits a representation in ra- 
tional functions. 

(a) Let either а or 8 be an odd positive number. For definiteness we 
put 8 = 2k + 1 where К > Oisan integer. Then с is any real number. Using 
the identity cos? x + sin? x = 1, we have 


[sine хсо8 хах = [e xcos?^*!xdx- [i x (cos? x)* cos x dx 


= |sin? x (1 — sin? х)“ cos x dx. 
Put sin x = 4. Then cesxdx = dt and 
sr" xcos** +1 хах = [ro - 17) dt. 
Applying the binomial theorem to the integrand on the right, we obtain 


(k + 1) power functions which are easily integrable. 
Examples. Compute each of the following integrals. 


(1) (зіл? x cos? x dx. 


«~ Manipulating on the integrand, we have 


(sin? x cos’ хах = fsi? x cost x cos x dx 


(sin? x(1 — sin? x)*cos x dx. 
Then the substitution sin x = £ gives cosx dx = dt апа 


[s x(1 — sin? x cos x dx = [ra - ?y.at 
1 2 1 


— |2 _ 974 6 ~-liw_4,5,1 л 
- (в 2t^ + f°) dt 3! 5) +71 +C 


2 
3 


sin? x -i sin? x +1 sin’ x + C. 
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n3 
Q) E dx. 


COS x 
-« We have 
8-3 2:2 2 
sin x sinx . 1 — соѕ х . 
———5— dx = \ —_,— sinxdx = \——;—— sinxdx. 
cos“ x cos“ x cos’ x 
Put cosx = t. Then sinxdx = —dt and 





2 2 
== t sin x dx = | ao dt = 10 - т) 
Cc х 














Stith rC Costy + C. > 
t osx 
3 
(3) 18 X. dx. 
Asin x 
ча We have 
3 2 _ un? 
| бєх = | 3883 cos xar = | | = sin’x сохах 
Vsin x sin x sin x 


Put sinx = /. Then cosx dx = dt and 


-2 2 
22-54 cosxdx = | ! = dt = jure - уй 
sinx t 


= Vie? -Z 524 C= Weinx —2 (V sinxy + C 





(b) Let a and 8 be positive even numbers, ie, а = 2m and 8 = 2n 
where т and п are natural numbers. Here it is helpful to manipulate on 
the integrand by applying the trigonometric identities 


L= $08 28 and ee 1+ байы, (*) 


sin? x = 
Suppose that m = n. Then using the identities (ж), we obtain 
La x cos?" x dx = G X)" (cos? x)" dx 
2 1 — cos 2х Y / 1 + cos2x V 
Jes 


= — fa — cos 2x)"(1 + cos 2х)" dx. 





Applying the binomial theorem to the factors (1 — cos2x)" and 
(1 + cos 2x)" and multiplying the polynomials thus obtained we arrive at 
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the integrand involving odd and even powers of cos 2x. The terms involving 
odd powers of cos 2x are easily integrable as we have discussed in the 
preceding section. To integrate the terms involving even powers of cos 2x 
we apply to them the identity (*). This yields the terms involving powers 
of cos 4x. Continuing this process we finally arrive at integrals of the form 


[cos kx dx where k > 0 is a positive number. These integrals can be com- 
puted without difficulty. 
Suppose now that m = n. Then applying the identity 
sin x cos x -5 sin 2x 


we obtain 


2n 
sin?" x cos?" x dx — [en x cos х)?" dx = | ( sin 2x) dx 
= r3 [in 2x dx = 52 foin? 2х)" dx 


= HC = cos tr) dx = a fa — cos 4x)" dx. 








2 


Clearly, the integral on the right is easily computed as given above. 
Examples. Compute each of the following integrals. 
(1) jsin? x cos* x dx. 

за We have 


2 
(sin? xcost хах = | is cos 2 Е + 2228) 2 


== | “ог 2x — cos? 2x — cos? 2x) dx 





- 


| [1+ 2x— 1+8054х _ (1 —sin? 2x)cos | ах 
d es 
2 


cos 4x + sin? 2x cos 2x) dx 


sin 4x ++ sin? 2x) + С т 


кә 


(2) [sin x cos? x dx. 
< We have 


(sin? xcos?xdx = [cmn xcos x)? dx = ; [sin 2x dx 


= 5 (A= ae = 3 (x - 


i > 
1 7 sin ax) + С. 


Al 


29—9505 
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Integrals of the form [sin ax cos fx dx, [cos ax cos $x dx, 


sin ax sin бх dx where a # 8. To compute these integrals it is helpful to 
use the following trigonometric identities 


sin ax cos Bx = [sin(a + 8)x + sin(a — 8)x], 


cos ax cos Bx = [cos(a + 8)x + cos(a — 8)х], 


Nl) N| = N| = 


sin ах sin Вх = = [cos(a — 8)x — cos(a + 8)x]. 
For example, if we wish to compute [sin ax cos Вх dx then 


[sin x cos Bx dx = 3 6 + В)х + sin(a — 8)х] dx 





2 а + В а – В 


The other two integrals are computed in a similar way. 


Ж s|- cos(a + х _ соѕ(а — 9» | +C 


Example. Compute [cos 3x cos x dx. 
ча We have 


[os 3x cosx dx — т | 4х + cos 2х) dx 


fl. = opes 1727 
= (апас 3 адк) Ср sind +g sin2r+ С» 


Ехегсіѕеѕ 


Apply the standard integration formulas to compute the following 
integrals 


1. [^ Vxdx. 2. | Ta . 3. [e dx. 4. 6) dx. 


5. [reas 6. E dx. 1. | 245 qx 8 ак 








32 10" cos?x sin^x 


9. ana dx. 10. pu dx. 11. | савх - cot хУах. 


sin x cos! x sin 5x cos 2x 
12. : dx — ОЙ. m dx = sinh 2x 
cos^x sinx sinh^x cosh^x cosh x 
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3 2 ах dx 
15. [tanh xdx. 16. (со хах. 17. (чыъ Ы (=> eg 
19 IE 20 ise 
NV 4x7 + 9 N9 – 4x? 
Use integration by substitution to compute the following integrals 
3 
279 2 xX x ax 
21. Б ах. 22. Б sin түг de 23. Im dx. 24. 1-2. 
25 | ах | ах 21 | ах 
Ve (1 + Vx) Ve [1 — Ух Ne +1 
28. (87 ax. 29, (x20 — 1) dx. 30 dx зу, xx + Гах 
ө ех = 1 » . . е Ух 21. . е . 
3 2 
dx x 
32. hx dx. 33. \z 34. | ХӨХ с зе. [a 
Vx+1 V1 -x 1 + x4 M1 
36. | Intanx gy у | сох gy за. |-52Х tant gy 
sin 2x In sin x cos” x 
-1 (ааг!) 
39. [2 xe x 


14x? 


ах. 40. |—— PX, . qx 
x(4 + In* x) 


Use integration by parts to compute the following integrals 





E tan? x dx. 


| dx 
(1 — 2x) ` 


41. pena 42. [e dx. 43. [xsin axa 44. fa + х)е" dx. 
45. fa + xln2)2* dx. 46. G - x je dx. 41. fran" xax 
48. [rtan хах 49. [in ха 50. | T dx. 51. 
cos” x 
52. \cos (Іп х) dx. 53. [sin (In x) dx. 
Integrate partial fractions 
54. 2295 =S 565. 2.767 
5 + 2x 2 – 3x (3х + 5) 
ах х+ 2 
58 —— . —— —— dx. 60. Ia 
5% I3 x 
61. 


29* 


хах 
х + 7х + 13 ^ 
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Applying the method of comparing coefficients, compute the following 
integrals 


62. (ске © * | хийх 64, | 3x” - 2x - 4 








х? + 3x – (х + Ох + 1) (х = Dx? — 4) 


3 2 
65. [at ae 66. E +1 gx 67. ra 68. (2 +2х dx 
x^ x!-x (x 














+ 3) (x + D* 
dx x? - 3x dx dx 
69. | —— —-. 70. |--------4х.71. | —5——-—. 72. . 
35 + х“ | (x + D(x-1* 4) (г + 1) (= 
n. | XE. 
x 1 


Compute the integrals involving irrational functions 
14. К ШЕ qus СЕЕ ды 
(1 - x) 1+х х 
1 -x 
76. ——— - Ў 
FER EF. А | 1+х ШШ 
x 


] * X dx. 
1-Х 
79. БЭ dx 














4 
18. [5 o h 
Nx? — 4х Vax — х2. Ve- 4x — 5. 

$1. |. 82. а 83. Le 

Vx? – 4x + 5 М5 – 4х – х? ? —6х-—-1 

2 2 

84. [= dx. 85. ILL dx. 86. |ва dx 

М1 + 6x—x? VI + бх—х° М? + 2х + 2 


NT 2 5 6 
87. ухас ах. 88. [251 dx. 89. 4 ‚ 90. Їэ ах. 


N1-x Мх?+2 1-x? М1+х? 
Compute the integrals involving trigonometric functions 
ONERE E D а ЕБЕ БЕИ 
2 + cosx 5 + 4sinx 3sinx — 4cosx 


94. оо ‚95. | 32x — ay. 96, сы жаш 
5 +sinx + 3cosx 1 + sin^x 5 + sin*x—6sinx 








97. HERE dx 98. (со хах 99. [sins x dx. 
1 — cos x + sinx 
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<3 
100. (sin? xos? хах 101. [cost xsin” xax 102. | к 
cost x 


103. | sin?xcos?xdx. 104. [n xdx. 105. [sin x cos? x dx. 


2 
106. | Ф. 107, | Sera 108. s 
sin x sin X sin XCcOS X 





109. |= 110. (sin 5хсо8хах. 111. [sin xeos 5x dx. 
cost x — sin‘ x 


112. [cos 7х cos 3x dx. 113. [sin 15x sin l0xdx. 114. [os cos dx. 


115. (sin xsin 2x sin 3x dx. 


Answers 
1. 03x? + с. 2. dx 4 C. 3. аза С 4. X оха an|x| s С 











x x -x —Xx 
15 pe l (2) ko -2—-2 ee -cotx-tax+C 
In 16 In 2 3 In5 In2 


9. 2tanx+C 10. 2x+C 11. tanx- cotx- 4x + С. 12. tanx — cotx + C. 


13. —tanh x – coth x + C. 14. 2cosh x + C. 15. x – tanh x + C. 16. x – cothx + C. - 








17. Їчал!2х-с as -1ш1325:31-Сс 19, nV2+Vaer+9 +С. 
6 3 12 "2:3 
1.12 


3 
20. -sin^! х-С 21. e°% + С. 22. -соз 1-С 23. - V1- + С 24. In | In x | +С. 


`2 3 


25. 41п (1 + V) + С.26. -8 V1 - Ух + С.27.1п-У© +1 —1 + сов. In |e — 1| 
е+1 +1 


21 20 19 
ё+сС 9. 971, 6D .825-с 30. 21ап-! Vx — 1. 


3 D 
1.5 6 + 1) 3(4х — 3) + С. 32-10 (x + 1)2/3(2х – 3) + С. 33. – (2 + xy1-x! +С. 


34.5 tan ^! x?+C. 35.4 sin^! x'« C 36.4 In? tan x4 C. 37.1n | In sin x | +C. 38.5 gan хс, 


зо. È ean s + C. 40. In V4 + I? x + C. 4l. -(x + De + C. 42. AM I нас 
n 


43. 1 sin 2x = 5 cos 2x + C. 44. хе + C. 45. x 2' + C. 46. x/e^" + C. 47. хїап x — 


In VI +? +С. 48. 168 Dua x-Ix4C 49. xsinx + V1—-x +С 
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2 
50. xtanx + In|cosx| +С. 51. — Jy + xtanx + In[cosx| +С. 52. 5 (cos In x + 
sin In х) + C. 53. 5 GinInx — cosInx) + C. 54. In| 5 + 2x | +С. 55. 110 |2 - 3х| +С 
вс 57. 1С 58. tnc! X * 5 +с. 59, ама +2х+5 + 
6(3х+ 5) 8(1 — 2x) v2 v2 
liant XE] с 60. ам 2 2x2 + tan 201 46 61. in x13 — 
2 2 v7 v7 
dox 4: 2 хэ 1| 
= + С. 62.1п | х^ + 3x — 10| +C. 63. In ————— + C. 64. In| (x - 
48 Суў | | JT2x +1] | 
2 x х-1 х x!-1 1 
Е poe RE. n i-o 
Do? - 4)| +С. 65 z + In = + C. 66 ; +1 + С. 6 ura 
In|x 3| +С 68. bcm up. ЕБЕ елы ж ses 
3(x + 1y x1 х 2€ x4! 
70, — e “эхээс тыш С 
7. х +1 
72. lin |Æ | -Їаал!х Са. 1( x-1 + УЗ лап 2X*l).c 
4 х+1| 2 3 A xd 45 
14x мМїҮх-1 х 
14. +C 75. 241 x « In +С. 76. -+C 
J= Vitex +1 Lax 
3f 1- x V^ Pag. -1х- 2 
TI. Tah Fig +C. 78. In|x- 24 x'-4x | «c 79. sin 3 +e 


80. In | x - 2 + V2-4x-5 | «C 81. n(x - 2 + V£- 4x + 5)«C 82. sin! 242 
8. Vx – 6-1 +4 | х- 3+ Мю 6-1 | «C 84 201 + 6х x? + 





4snc! 23 46 85 — -Viséx- x +C 86 Wee mw+24+C 
У10 
2 4 
87. 21-32 + sin“? x&C. 88. x x32 -In(x «x? + 2) + C. 89. rtm. 
4Ài-x «c 90. (6-5 +1 ын) vs +? --кш(к+у +1)+с 
2 5tan* +4 1 212n7 - 1 
91. Iní 3 + ta X | «C. 92. tan ——— +С 93. iIn| —_-*_] «C 
2 3 3 5 x 
2tanz + 1 


21215 +1 1, 5-sinx 
tan! + C 95. In + si? х) + С 96. >In ——— — +С 


2 
38 415 415 4 1 - sinx 
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x 
tan = 
97. -x+ 2ln — —2 |с 98 sinx- six + С 99. -L cos" x + 2 cos? х 
1 x 3 5 3 
+ tans 


cos x + C. 100. | sin? х 1 лух + С. 101. lcosx-lecosx + C 102. Ел” 
3 5 7 5 3 cos! x 


1 x sin 4x 3 TI 1. x 
FOX + C 103. 8 32 + C. 104. вх 34 3n 2x + ao sin 4x + C. 105. 16 








; 23 
sináx _ sin 2x , © 106. -cot x = у cot? x + C 107. - cot? x + C. 108. tanx + 


64 48 
т 
їап (: 1) 


111. --6x , ©0з4х ‚с уу sine ү SEH усан. - 





__соѕбх | cosáx , ү 


+С. 110. 12 8 


junx- 2 cot 2x + C. 109. -} In 














sin 25x 4 Sin 5x +С. 


20 8 50 10 





3:25 -Х cos 6x cos 4x cos 2x 
114. z sing x + 3 зіпе + C. 115. 24 16 8 + С. 


Chapter 10 


Integral Calculus. 
The Definite Integral 


10.1 Basic Concepts and Definitions 


Area of a curvilinear figure. Suppose we wish to compute the area 
of the plane figure aA Bb bounded by the graph of the positive-valued con- 
tinuous function f(x), the x-axis and the vertical lines x = a and х = b 


(Fig. 10.1). 
Let us divide (or partition) the closed interval [a, b] into n subintervals 
by choosing points xo, хі, X2, ..., Хи such that 


M HACK <<... Mi € X, = Б. 


























ai i l 
0| r-a& ХЕ, х, 1; Ina En 1-5 I 








Fig. 10.1 


Next we take an arbitrary point £y (xx 1 < ёк < xx) in each subinterval 
[хк - 1, хк] and construct и rectangles with bases [xx — 1, xx] and hights f(&), 
К = 1, 2, ..., п. The area of the Ath rectangle is 


ДО, = f(E)Ax«, 


where Axk = Xk — Xx -ı is the base length. 
By doing so we obtain a polygonal figure made up of n rectangles; its 
area Q, is equal to the area of the union of these rectangles so that we 
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can write 


О, = fi&)Ax + /@)Аж +... + ДБА = Ў fix. 
К-1 


Clearly, if we divide the interval (а, b] into smaller subintervals the 
number of subintervals increases and the lengths of these subintervals be- 
come smaller so that the polygonal figure thus obtained comes closer to 
the plane figure aA Bb. 

Let А = max Ax, be the largest of the lengths of.the subintervals 


1I«k«n 
[хк-1, Xx], К = 1, 2, ..., n. Evidently, the number of subintervals tends 
to infinity and their lengths Axx tend to zero as А > 0 since 0 «Ах < А 
for all К = 1, 2, ..., n. 
Suppose that there exists a finite limit Q of the polygonal figure as 


A- (тах Axx > 0. Then this limit is said to be equal to the area of the 
&K&n 


plane figure aABD, i.e., 
О = lim О, = lim >) f(&)Ax.. 
470 Х-04-1 
This limit, if it exists, must be the same for any collection of subintervals 
[хк - 1, хк] and any choice of points £ in the subintervals. Thus computing 


the area of the plane curvilinear figure aABb becomes equivalent to evalu- 
ating a limit of the form 


n 
mally 22 ft. 
1<К<л 
Distance travelled by a particle. We wish to find a distance S travelled 
by a particle during a time interval from ¢ = £o to t = T provided that the 
velocity v of the moving particle is described by the function v = f(t). 
Let us partition the interval [/0, 7] into n small subintervals by choosing 


points fo, її, б, ..., їл such that 


lo«ti«b«...«t,- T. 


Suppose that the velocity v(f) remains nearly unchanged on any subinterval 
[fk - 1, fk] so that it may be thought of as being equal to the value of v(t) 
at Tk € [fk -1, &], ie, v = Д), К = 1,2, ..., n. Then the distance travelled 
by the particle during the time interval Afk = [fk — fx 1] is nearly equal 
to Sk = f(rx) Atk; hence we can approximate the distance 5, travelled by 
the particle during the time interval [fo, T] as 


Sn = $1 + $2 +... + 5л = f(n) Ali + f(r2) Ab 


+... + f(t) Аһ = Y JG) Atk. 
k=1 
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We denote the largest of the subintervals Af, k = 1, 2, ..., n, by 
h= max Мк. The number of subintervals increases infinitely and their 
&SKS&n 


lengths tend to zero as А > 0. Evaluating the limit of S, as А > 0, we obtain 
the exact value S of the distance travelled by the particle from to to T; 
so we write 


S — lim У) Атк) Atk. 
AVVO к=] 


Thus we have arrived at the limits of the same form as before; if these 
two limits exist they are called the definite integrals of the functions f(x) 
b T 


and f(t) denoted by the symbols | Лх) ах and | S(t) dt, respectively. 


a fo 


Definition of the definite integral. Let f(x) be a function defined on 
the closed interval [a, b] (a < b). Proceeding as before, we divide [a, b] 
into n subintervals by choosing points 


X) —a«x «X «... € X»- 1X X, = D. 


We call the collection of subintervals of [a, b] the partition of [a, b]. 

Let Axy = xx — Xk- 1 > 0 be the length of the Ath subinterval and Ёс 
be a point of the kth subinterval for each К = 1, 2, ..., п. A collection 
of points ё, Б, ..., & is called the selection for a given partition of 
(а, b]. Given a partition of [а, b] and a selection for this partition we 
can evaluate the sum 


Sn = ХЕ) Ах + ЛЬ) Ae +... + Л) Ax. = 23 ЛБ Ах, 


where f(x) is the value of f(x) at the point £x € [хк-1, хк]. This sum is 
called the integral (Riemann) sum for f(x) determined by the given partition 
of (а, b] and the given selection for this partition; hence a value of S, 
is dependent on the partition of [a, b] and the selection for the given parti- 
tion, i.e., it depends on the way of dividing the interval [a, b] into subinter- 
vals [xx - 1, хк] and choosing points £ in these subintervals, A = 1,2, ..., п. 

Let А be the largest of the lengths of the subintervals [xx - 1, хк], К = 1, 
2, ..., п, ie, \ = max Ах, We say that a number J is the limit of in- 

n 


I«k«n 
tegral sums >, f(£k)Axx for f(x) on [a, Б] if given any number є > 0 there 
К-1 
exists a number 6 > 0 such that 
n 
| Уюл - 1| < 
k=1 


for every partition of [g, b] with Ах, < ô, k = 1, 2, ..., n, ie, for every 
partition with А < ô, and every selection kx, k = 1, 2, ..., n. 
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In symbols, we write 
n 
J = lim У) АЕ) Ах. 
э0К-1 


Notice that the number ô depends on the value of €; to signify this 
we shall sometimes write 6 = 6(e). 
If given any partition of the closed interval (0, b], a < b, and any selec- 
n 


tion £&, k = 1,2, ..., n, the integral sum >) f(x) Axx has the same finite 
К-1 
limit J as А > 0, this limit is called the definite (or Riemann) integral of 
b 


fœ) on [a, b] and denoted by | f(x) dx. Thus by the definition of the 


а 


definite integral we have 
b 


J= | Ло) dx = lim Ў) ЛЫ) Аж. 
Х-0К-1 


а 


The numbers a and b are called the lower limit and the upper limit 
of the integral, respectively; x is called the variable of integration. The func- 
tion f(x) is called the integrand and the expression f(x) dx is called the ele- 
ment of integration. 

It is worth noting that by virtue of the definition the definite integral 
remains unchanged when at any point c in [g, b] the value f(x) of the in- 
tegrand is replaced by some other number. In other words, if we replace 
the integrand f(x) by the function 


_ (Jœ) for xe[a, b], x zc, 
#6) = Р for х = c, 


where A з f(c), then 


b b 
| ЛТОдах = | #(х)ах. 


This is also true if f(x) is modified at any finite number of points in [a, 5]. 


The definition above applies only if а « b; it is also convenient to in- 
a 


clude the cases a = b and b « а. We put | То)уах-0 for b = a and 


a 


a b 
| Л) dx = - | feo dx for b < a. 
b a 

b 


Example. Compute | ах. 


а 
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ча By the definition we have 


b 
| dx = lim Y Ахк = lim 2 (хк — xk - 1) 


AO к=] 
= lim ба — Xo) + (GG — M1) +... + (Хя-1- Xn-2) 
+ (хл — Xn-1)] = lim (xn — х) = lim(b-a)=b-a > 
хо х—0 


Integrable functions. A function f(x) defined on the closed interval 
[a, b] is said to be integrable on [a, b] if there exists the definite integral 
b 


| Хо) dx on [a, Б]. 

Theorem 10.1. 7f the function f(x) is integrable on the closed interval 
[a, Б] then f(x) is bounded on [a, Б]. 
ч Let f(x) be unbounded on [a, b]. We divide [a, Б] into n subintervals 
[x - 1, х], К = 1, 2, ..., n. Since f(x) is unbounded on [a, Б] there exists 
a subinterval, say [xo, xi], where f(x) is unbounded. Consider the selection 
t, b, ..., & and the integral sum 


$ = 23 Sk) Ах = f(&) Ах + 22 SE Ах. 


Clearly, by a suitable choice of & in [xo, xi] where f(x) is unbounded it 

is possible to make the modulus |S,| of S, arbitrarily large provided that 

the points 5, &, ..., & are the same for all selections, ie, provided that 
n 


the sum У) Д(&) Axx remains the same for the chosen & in (Хо, x1]. This 
k-2 
implies that the integral sum 5, has no finite limit as max Axy > 0, ie., 


$ gn 
f(x) is not integrable on [a, Б]. Whence it follows that if f(x) is integrable 
on |а, Б] then f(x) is bounded on (4, b]. > 
Remark. If a function is bounded on [a, Б] it is not necessarily integra- 

ble on this interval; in other words, a function can be bounded on [a, 5] 
but not integrable on [a, b]. For example, the Dirichlet's function 

1 for rational x 

fœ) = ро) = ae 

O for irrational x 
is bounded on the closed interval (0, 1] since |f(x)| < 1 for all x in [0, 1]; 
however f(x) is not integrable on [0, 1]. 


44 Indeed, the integral sum Sr = p Лё) Axe becomes $,- 


2 1-Ах,- = Дх, = 1 for every ы п of rational points &, k = 1, 
k- 
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n 


2, ..., hn, and Sn = >) 0- Ax, = 0 for every selection of irrational points 
k=1 
£k. Hence, given any arbitrarily small А = (тах Ax, the integral sum Sn 
&K&n 


is equal either to 1 or to 0; this means that S, has no limit as А > 0, ie, 
f(x) is not integrable on (0, 1]. » 

We shall give without proof three theorems that outline sufficient condi- 
tions for a function to be integrable on a closed interval. 

Theorem 10.2. 7f the function f(x) is continuous on the closed interval 
la, b] then f(x) is integrable on [a, b]. 

For example, the function f(x) = е“ is continuous on (0, a] where a 
is an arbitrary number and, consequently, f(x) is integrable on (0, a], i.e., 

+ о 


there exists the definite integral | e dx of this function. 
a 

Theorem 10.3. 7f the function f(x) is defined and monotone on the 
closed interval [a, b] then f(x) is integrable on [a, Б]. 

It is worth noting that all values of the function f(x) which is monotone 
on [a, Б] lie between the numbers f(a) and f(b); so f(x) is bounded on [a, b]. 

Theorem 10.4. Let a function f(x) be bounded on a closed interval 
[a, b] and let f(x) have a finite number of discontinuities (of the first or 
second kind) on [a, b]. Then f(x) is integrable on [a, b]. 

For example, the function 


sin} for x #0 
70) = 1 for x = 0 


is integrable on the closed interval (0, 1] since |/(x)| <1 for all x in 
10, 1], i.e., f(x) is bounded on (0, 1], and f(x) has the only discontinuity 
(of the second kind) at x = 0. 


10.2 Properties of the Definite Integral 


We shall derive some properties of definite integrals assuming that 
all functions in question are continuous and, consequently, integrable on 
a closed interval [a, 5]. 

(1) The definite integral is dependent only on its lower and upper limits, 
ie, on the numbers a and b, and on the integrand f(x); however, it is in- 
dependent of the variable of integration. So the definite integral remains 
unchanged when x is replaced by any other symbol, i.e., 


b b b 
| /судх- | лда = | fa) dv. 


a a a 
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(2) The constant factor can be taken outside the integral sign so that 


b b 
| Af(xjdx =A | Sx) dx, A = const. 

ча By the definition of the definite integral we have 
b 


| A f(x) dx = lim Y A f(£k) Axx 
2 APO к=1 


n b 
= A lim 5; f(&) Ах = A | До) dx. P 
Х-0К-1 2 


(3) The definite integral of the sum (difference) of two functions is equal 
to the sum (difference) of the integrals of these functions, i.e., 


b b 
Lite) Aide = | fie) dx + | Љо) dx. 


Reese Fo WS 


(109 + i00] dx = lim 2 LG) + ААХ 


aif | Уг file) Ах + X еам | 
zl 


A720 | k=1 


= lim У) file) Axe + lim >; /(8) Ах 
Х-0Кк-1 Х-0Кк-1 


b b 
= | ло) dx | ^о) ах. » 


а а 


Corollary. The linearity property of the definite integral: 


b b b 
| [Ai G0 + A300] ах = Ав | ЛО) dx + Аз | Р(х) dx, 
where А4; and Ах are arbitrary constants. 
(4) Given any numbers a, b and c there holds 


b c b 
| Л) dx = | Л) dx + | Л) dx 


provided that both integrals on the right exist. 
This property is often called the interval union property. 
ча We distinguish two cases. 
(a) Let a « c « b. Then by the definition of the definite integral we have 


b n 
| f(x) dx = lim 21 f(&) Axe. 
i M70, 4 


10.2 Properties of the Definite Integral 463 


Since the definite integral is independent of the partition of (ад, Б] we 
can include the point c into the selection for the given partition of [a, 5] 
by choosing the partition (Fig. 10.2) as 


M HACK «x <... < Xn ^ CX Xn«1 ... < Xo = D. 


Then the integral sum У) f(&&) Axx associated with this partition of 
k=1 


[a, b] can be split into two integral sums as 
>, SE) Axe = У ДХ) Ах + DP) NEk) Ax, 
К-1 k=1 к=т+1 


where the sums on the right are associated with the partitions of [а, с] 
and of [c, 5], respectively. 


1 




















d M SN = 
0| щ=а х, т, Im Imai b-IQ х 


Fig. 10.2 


Evaluating the limit as \ = max Ax, > 0, we obtain 


<К<п 


b 


| Ло) dx = lim У) ДЕ) Axe = lim >, ftr) Axx 
Ч хә 0 k=l 20 К-1 


n с b 
+ іт У) Л) Ах = | Дх) dx + | Јо) dx. 
1 a с 


A7Ok=m+ 
(b) Now let a « b « c. As before we have 


c b c 
| Л) ах = | feodx + | Ло) dx. 


a a b 
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Whence 
b c c c b 
| fox) dx = | Дх) dx – | Дх) dx = | f(x) dx + | f(x) dx. 
a a b a с 
Given f(x) > 0 and a < с < b this property implies that the area of the 
curvilinear figure 2ABb is equal to the sum of the areas of the curvilinear 
figures acCA and cbBC as easily seen from Fig. 10.2 > 
(5) Let the functions f(x) and g(x) be such that f(x) < g(x) on [a, b]. 


Then 
b 


b 
| fe) dx « | во) dx. 


This means that integration preserves inequalities between functions. 
"4 Since f(x) < g(x) at every point x in [g, b] then given any partition 
of [a, 0] and any selection &, k = 1, 2, ..., n, there holds 


> Fx) Ах < È g(E) Axr. 


UN the йып а$ к= Шах Ахь- 0, we get for a<b 
<п 


| fade | | в0) ах > 


a a 








Fig. 10.3 


Remark. If f(x) 2 0 and g(x) > 0 on [0, b] this property means that 
the area of the curvilinear figure @bB,A, does not exceed the area of abB242 
(Fig. 10.3). In particular, this property implies that 


b 
| sonar > 0 (| оао) 


for f(x) 20 (f(x) < 0) on [g, b] where a < b. 
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(6) Let a « b. Then 
b 


| T(x) dx 


а 


b 


< | дах. 


а 








ча Evaluating the integral of 
-F| < Дх) < |/09| 
from a to b, we obtain 


b b b 
- | ldx < | feo dx « | fel dx 


a a a 


so that 


< f wide > 


a 


b 
| fle) dx 


a 








(7) Let m and M be the minimum and maximum values of f(x), respec- 
tively, on [g, b] where a < b. Then 


b 
m(b — а) < | f(x) dx « M(b — a). 


a 


9 Since m x f(x) < M for all x in [a, b] Property 5 yields 


b b b 
| тах « | дах < | Мах. 


Observe that 


b b 
| mdx =m | dx = m(b — a) 
and І i 
b b 
| Mdx = M | dx = M(b - а). 
Whence 


b 
m(b — a) > | ТОодах < M(b — а). > 
Remark. Given a function f(x) > 0 on [a, b] where a < b, this property 
implies that the area Q of curvilinear figure abBA lies between the areas 
О, and Q, of the rectangles abBiA; and abB,A; (Fig. 10.4), ie, 
Qı < Q S< Q. 
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Fig. 10.4 
Qn 
: ах 
Examples. (1) Evaluate the integral ——————-. 
y N10 + 6 sinx 
ча Since 
m= min r-—-———— —— = 0.25 
O<x<2rV10 + 6 sinx У10 + 6 sinx x-l 
and 
M = тах ! - = = 0.50. 
0<х<2тү10 + 6 sinx N10 + 6 sinx Em 
2 
Then Property 7 yields 
2a 
2т х 025 < | Rode. oxo 
i N10 + 6sinx : 


and 
2x 


T < | ___ах___ < т. > 
2 y У10 + 6sinx 
(2) Evaluate which of the integrals 
1 1 
| e^"dx and | e “ах 
0 0 


is larger without direct computations of their values. 
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я We have x? < x for all x in [0, 1]; whence —x € —x*. Since e > 1 then 
e'*«e' * and Property 5 yields 

1 1 

| e “dx> | e""dx. > 

0 0 


10.3 Fundamental Theorems for Definite Integrals 


Mean value theorem. We start with a theorem which states that 
every continuous function on a closed interval attains a specific value called 
the mean value of the function. 

Theorem 10.5 (mean value theorem). Let f(x) be a continuous function 
on a closed interval |a, b]. Then there exists at least one point Ё in 
[a, Б] such that 


b 
| Ло) dx = (b - af), a < E« b. 


a 


ча Since f(x) is continuous on [a, b] it has the minimum value m and 
the maximum value M on [a, 5] (see Fig. 10.4). Then Property 7 yields 


b 
m(b – а) < | Дх) dx € M(b – a). 


a 


Whence 


Дх) dx 


— 5 
b-a M 


Q < С 


т< 





because b — а > 0. 
Put 





b 
1 а 
as | Јо) dx = р, 
where m < y « M. 

Since f(x) is continuous on [a, 5], it attains all intermediate values be- 
tween т and M. This implies that there exists a point x = £ in [а, b] such 
that ДЕ) = p, i.e., 

b 

| f(x) dx b 

EX TTE ng o | Јо) dx = (b - aff), a&£sb. > 
a 


30* 
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Remark. For a « b we have 








a«t«be0«t-a«b-ae0« = <1. 
Put 

саг. -0,0«0«1. 

b-a 


Then £ = a + (b — a)0 and we rewrite the conclusion of this theorem as 


b 
| Jœ) dx = (b— a)f [a + (b – 0)0], 0<6<1. 


а 


y=f(x} 





fle) 




















Fig. 10.5 


To interpret the mean value theorem for integrals we look at Fig. 10.5. 
The curvilinear figure abBA is bounded by the graph AB of the function 
f(x) which is nonnegative on [a, Б] (a < Б), the x-axis and the vertical lines 

b 


x = a and x = b. We let О, denote the area of abBA; then | feo ах = О. 


For the rectangle abNM the base is [a, b] and the height is equal to the 
ordinate of the point C(£, f(£)) so the area О» of abNM is О» = (b — a)f(é). 
The mean value theorem tells us that there exists a point C(£, f(&)) on the 
graph AB such that О, = О». 
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The number 


1 
b-a 





MO) = 


Ces Су 


S(x)dx 


is called the mean value of the function f(x) on [a, b]. 

If a function f(x) is continuous on [a, Б] then there exists a point £ 
in [g, b] such that M[f(x)] = f(£). 

Example. Compute the mean value of f(x) — sinx on [0, 7]. 
«4 By virtue of the above definition we have 


M{[sin x] = — | sin xdx = + (—cos + cos0) - 2. » 
0 





T 


We have applied here the Newton-Leibniz theorem which will be derived 
in this section. 

Fundamental theorems of calculus. Let f(x) be a continuous function 
on [a, b]. We choose an arbitrary point in [а, b] and consider the definite 


b 
integral | f(x) dx. Since the definite integral is independent of х we can 
a 


substitute ¢ for x and write 
x x 


| foo dx = | F(t) at. 


а а 


Since f(x) is continuous on [а, Б], this integral exists at every x in [a, b]. 
Consequently, the integral becomes a function of its upper limit x. We let 
F(x) denote this function, i.e., 

x 


F(x) = | S(t) dt. 


а 
Theorem 10.6 (first fundamental theorem of calculus). Let f(x) be con- 
x 


tinuous on [a, b]. Then the function F(x) = | S(t)dt has a derivative at 
a 
every point x in [a, b] and F’ (x) = f(x). 

In other words, the derivative of the definite integral with respect to 
its upper limit is equal to the value of the integrand at the upper limit 
of the integral. 

“4 Consider the increment Ax з 0 such that x + Ax € [a, b]. This incre- 
ment gives the increment AF in F(x). By virtue of the interval union 
property (Property 4) of the definite integral we have 
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x+Ar x x+ Ах 


AF = F(x + Ax) - F(x) = | Адаг – | Ада = | F(t) dt 
a a : Tes E 
+ | S@ dt = | S(t) at + | л) а= | F(t) at. 


Applying the mean value theorem, we obtain 


AF = (x + Ax — X)f(x + 0Ax) = Ax f(x 8 Ax). 
Whence 


A = fx + 0 Ax), 0<0<1. 


Observe that f(x) is continuous at every point х in [a, b].. Then evaluat- 
ing the limit of the above ratio as Ax > 0, we get 


lim a = lim f(x + 8 Ax) = f(x), 
Ax Ах-0 


Ах-0 
that is, 


F' (x) = f(x) ог (i So a) = f(x) for all x in [g b]. > 


Remark. If f(x) is a continuous function on [a, Б] then for every x in 
|а, b] there holds 


b , x , x , 
(1л04) -(-1лоа) --(1лоа) = ло. 
x b 


b 


x , 0 , 
For example, (| "^ar = +e” and (| е-“ш) = e^, 
х 


0 


Theorem 10.7 (second fundamental theorem of calculus). Le! f(x) be 
a continuous function on a closed interval [a, b]. Then f(x) has an an- 
tiderivative on [a, b] and, consequently, f(x) has the indefinite integral. 
-4 Since f(x) is continuous on [а, b] there. exists the definite integral 
x 


x 
| f(t) dt for every x in [a, Б], i.e., there exists the function F(x) = | лра 
а а 
such that Е’ (x) = f(x) for all x in [g, b]. This means that F(x) is ап an- 
tiderivative of f(x) on [a, b]. Whence it follows that the indefinite integral 
of the function f(x) continuous on [a, b] admits a representation of the 
form 


x 


[409 dx — | f(t) dt + С, 


a 


where C is an arbitrary constant. M 
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Newton-Leibniz theorem. We now turn our attention to a theorem 
which is helpful in computations of definite integrals. 

Theorem 10.8 (Newton-Leibniz theorem). Let f(x) be a continuous func- 
tion on a closed interval [a, b] and let F(x) be an antiderivative of f(x) 
on [a, b]. Then 

b 
| f(x) dx = F(b) – F(a). 


a 


ч Consider the function 
x 
Ф(х) = | S(O dt, x € [a, b]. 
This function is an antiderivative of f(x) on (4, b]. Recall that any two 
antiderivatives of a given function differ from each other by a constant; 
hence there exists a constant C such that 
x 
Ф(х) = F(x) + C or | S(t) dt = F(x) + C 
a 


for all x in [a, b]. 


а 


Put x = а. Then | f(t) dt = F(a) + C. Since | Л) dt = 0, we have 


F(a) + C = 0; whence C = -Е(а). Thus we arrive at 
| Л) dt = F(x) - F(a). 


a 
b 


Put now x = b. Then | S(t) dt = F(b) — F(a) and, substituting x for 
t, we arrive at the desired formula 


b 
| Лх) dx = F(b) - Fla). > 


b 
Remark. Using the notation F(b) — F(a) = F(x) | , we can write the 


Newton-Leibniz theorem as 
b b 
| fo) dx = FG) |, 
a 
where f(x) = F’ (x). 
The Newton-Leibniz theorem establishes a close relation between the 
definite integral of f(x) and its antiderivative F(x) so that the computation 
of the former becomes the evaluation of the latter. 


472 10. Integral Calculus. The Definite Integral 


Examples. Compute the following definite integrals. 


4 
(1) | x dx. 
3 


ча Recall that 








2 2 
| хак е 5 + C and FQ) = 2-4 C. 
Then 
4 
2 4 2 2 
4 2 
25 = = 6. > 
| xdx 2, 5 7 6 
2 
(2) | sinx dx. 
0 
ча We have 
| sinx dx = —cosx = —cos mr — (—cos 0) = 2. » 
0 





0 


10.4 Evaluating Definite Integrals 


Integration by substitution. Now we consider how a method of in- 
tegration by substitution applies when we deal with definite integrals. 


b 
Theorem 10.9. Let there be given the integral | Ах) dx where f(x) is 


a 

а continuous function on a closed interval [a, b] and let x = e(t). Suppose 
that the function e(t) satisfies the following conditions: 

(i) the function e(t) assumes values from a to b when t varies from 
a to B so that ф(а) = a, e(8) = b and all intermediate values of e(t) are 
in [a, b]; : 

(il) the derivative e' (t) of e(t) is a continuous function on the closed 
interval [a, 8]. 

Then 


b 8 
| f(x) dx - | Ле( їе (0) at. 

ча Applying the Newton-Leibniz theorem, we have 
b 


| S00 dx = F(b) – F(a), 


a 
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where F(x) is an antiderivative of f(x) on [a, b], i.e., F'(x) = f(x) for all 
x in [a b]. 

Consider the composite function Ф(/) = F [e(t)] in / on [a, 81. By the 
chain rule of differentiation of a composite function we obtain 


$'(r) = F" [e(0] e'() =F Oe 0). 


Hence Ф(/) is an antiderivative of the function /1е(0| e' (7) continuous 
on [a, 8]. Then the Newton-Leibniz theorem yields the desired result, 
namely 


В 
| fle(01 e' (0 dt = &(8) – Ф(о) 


a 


b 
= Fle(8)) — Fle(a)] = F(b) - Fa) = | Ло) ах. > 


Remark. This method of integration is appropriate only if by a suitable 
8 


choice of y(t) the integral | fle(0] e'(t) dt decomes easier to compute 


а 
b 


than the original integral | Ах) dx. 


a 
Notice that the method does not require the original variable be sub- 
stituted for the new one. 
We shall illustrate the integration by substitution by means of examples. 
Examples. Integrate by substitution the following integrals. 


(1) | Ма? — x? dx (a > 0). 


0 


4 Put х= asint. Then dx = acostdt and Ма? — х? = a sint. Sub- 
stituting x = 0 and x = a into x = a sin /, we get two equations а sint = 0 
and a sin / = a; whence / = 0 and ¢ = 7/2 are the lower and upper limits, 
respectively, of the new integral. Finally we have 


4 2 2 
| а? – ж? ах = а | cos? tdt = а? | Pa 
0 0 0 
2 2 3 
C | bus] ) =. >» 
2 0 2 0 4 
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4 Putx-e' Since f= 0 for х = 1, f= 1 for x =e and t = Inx, 


e 1 
2 3 1 
| inc | 22-4 1 p> 
0 





x 3 вэ 





0 


It is worth noting that sometimes it is convenient to use the substitution 
t = (х) instead of x = e(t). 


In2 
(3) | Ne* — 1 ах. 


0 








2t dt : 
4 Put = Уе" — 1 . Then x = In ( + 1) and dt = pro MES 
for x = 0 and £ = 1 for x = [n2, respectively, we have 
In2 1 Ё 1 1 
ест goss | cd р Чексе, 
1-1 1-1 
0 0 0 
1 
1 1 1 
-2| ((-ттг)а-2(‹ — tan“! t ) 
l+t 0 0 
0 








220 -tan^!)-2-I. » 


2 
1 
(4) | (2x? — 1)Vx4 — 2x +1 dx. 


0 


ч Put: = x? — 2x + 1. Notice that here we do not need to find the func- 
tion х= (t) On differentiating = x*—2vx+1, we have 


dt = (Ax? — 2)dx; whence (2x? - 1) dx = lat and 


1 0 


| e?- уу эхэл dx = 5 | а= у. » 
0 


1 


The following theorem is sometimes useful in simplifying the computa- 
tions of definite integrals. 

Theorem 10.10. Let f(x) be an integrable function on a closed interval 
[—а, a] which is symmetric relative to the point 0, where a > 0. Then 


a 


n 2 | Лх) ах if f(x) is even, 
| fo)dx-2| ? 7 : 
2 0 if f(x) is odd. 
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я By the interval union property (Property 4) of the definite integral 


we have 
a 0 


| Jœ) dx = | fl) dx + | fle) dx. 
-а -a 0 

Put x = -t so that dx = —dt and t = —x. Then 
0 0 a a 

| Дх) dx = - 0-04 = 5-24 = | x dx. 


-а а 0 0 
Непсе 


а а 


| /одах= | 0-х) + foil dx. 
E! i 


Recall that for an even function f(x) we have f(x) = f(— x) so that 


| Лх) dx =2 | fo) dx. Similarly, for an odd function f(x), ie, for 
a 0 


А-х) = —Дх), | Дх) ах = 0. > 


For example, | sin? х ё ах = 0 since the integrand is an odd func- 
- я 


tion on the closed interval [— т, 7]. Indeed, 


cos(—x) — sin? x e", vxe[—m, m]. 


sin? (- x)e 
Integration by parts. We shall extend the method of integration by parts 
examined in the previous chapter to definite integrals. 
Theorem 10.11. Let the functions и = u(x) and v = v(x) have continu- 
ous derivatives u' (x) and v'(x) on a closed interval [a, b]. Then 
b b 
| u dv — uv s | v du. 


a a 
ча Ву the hypothesis uv = u(x) v(x) is a differentiable function on [a, 5]. 


Then (uv)' = uv’ + vu’, ie, uv is an antiderivative of the function 
uv’ + vu’ on (0, b]. Applying the Newton-Leibniz theorem, we obtain 


b b b 
b ГА 
| (uv' + ои’) ах=и | and | uv’ dx + | vu’ dx=uv 
a 
a a 


Whence 


b b 
| uv! dx = uv |? – | vu’ dx. 


а а 


476 10. Integral Calculus. The Definite Integral 


By the definition of the differential we have v’ dx = dv and u’ dx = du. 
Substituting these relations into the above identity, we obtain the desired 
result 


b b 
udv = uv |? — f vdu. >» 
a 
a 


а. 


Examples. Apply integration by parts to the following integrals. 
(1) | (т — x)sin x dx. 
0 


ч We have udv = (т — x) sinx dx. Put u = т — x; then dv = sin x dx, 




















du = —dx and v = —cos x. Integration by parts yields 
| (т — x)sinx dx = – (т — х) соѕх |" — | cos х dx 
0 0 
= (x — T)cosx "im sinx = T. > 
e 
1 
(2) | 1X dx. 
х 
1 
4 4. : 
ч Put u = In x and dy = Z. Тһеп du =~ and v= 234 and in- 
X x x 
tegration by parts gives 
e e 
I e а е е 
| ПХ dx = -Linx ТЕБЕ am 
x х А х E * [+ 











1 1 


mb od quce = 
е е 


10.5 Computing Areas and Volumes by Integration 


Areas of plane figures in Cartesian coordinates. Let f(x) be a 
continuous nonnegative function on a closed interval (8, b] where a < b. 
Then the area Q of the curvilinear figure abBA (see Fig. 10.6) is 


b 
Q= | f(x) dx. 


a 


For example, if a plane figure is bounded by the parabola y = x’, the 
vertical line x = a (a > 0) and the x-axis as shown in Fig. 10.7 then its 
area Q is 


a a 


Зан 


3 
0- [22-4 


0 
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Fig. 10.6 


Suppose that f(x) is a negative function on the closed interval [a, 5], 
where a < b, ie, f(x) < 0 on [a, b]. Then the region (the plane figure) 
bounded by the graph of y = f(x), the lines x = a and x = b and the x-axis 

b 


lies below the x-axis (see Fig. 10.8) and | f(x) dx < 0. In this case the area 


Q of the plane figure abBA is 
b 


Q= - | ахо Q = 


а 


b 
| Ло) ах 








Example. Compute the area of the plane figure bounded by the parabo- 
la y = x? — 2x and the x-axis (Fig. 10.9). 








Fig. 10.7 Fig. 10.8 
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ч Since y < 0 on [0, 2] the plane figure lies below the x-axis, the desired 


area is 
2 


0 


2 


х 
" 3 


w| 








2 2 
Q=- | œ- mwa- | а-а 
0 0 


» 


[IE 


and О- 





Fig. 10.9 


Let a function f(x) change its sign when x passes through a point 
c € (a, b). Then the plane figure bounded by the graph of y = f(x), the 
x-axis and the lines x = a and x = b can be regarded as being made up 
of two plane figures lying above and below the x-axis (Fig. 10.10). In this 
case the area Q of the plane figure is 
b 


| f(x) ах 


с 








О= 0+0 = | ло) ах + 
For example, for the plane figure bounded by the parabola у = 1 — x?, 
the line x = 2 and the x- and y-axes (Fig. 10.11) is 
2 


| (1 – x)ax 
1 








1 
О = | (l — х2) ах + 
0 


2 2 


I 


1 
+ 
0 


1 


х х 
ES к "a 




















1 


— 


2542-3 | =2 


w| оо 





3 
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Fig. 10.10 
Fig. 10.11 


Suppose now that f(x) and g(x) are continuous functions on [a, 5], 
where a « b, and f(x) > g(x) > 0. Let the graphs of y = f(x) and y = g(x) 
intersect at the points A and B (Fig. 10.12). Then the area Q of the plane 
figure bounded by the graphs of y — f(x) and y — g(x) is equal to the differ- 
ence of the area Qi of the plane figure aACBb and the area Q» of the 
plane figure aADBb. Thus 

b b b 
Q= | бдах- | g@xdx or Q= | [/х)— 001 dx. 


а а а 











Fig. 10.12 
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To find the limits a and b of integration one must eliminate y from 
the system of equations y + f(x) and y = g(x) and solve the equation 
Ах) = р(х), whose real roots will be the limits sought for. 

Example. Compute the area of the plane figure bounded by the parabo- 
las y = 4x — x? and y = x? — 4x + 6 (Fig. 10.13). 





y=x?-42+6 








Fig. 10.13 


ча The abscissas of the points A and B where the parabolas intersect are 
the solutions of the equation 4x — x? = x? - 4x + 6 or х? - 4x +3=0 


whose roots are xı = 1 and x? = 3; hence the lower and upper limits of 
integration are a — 1 and b — 3. Then the desired area Q is 


3 3 


Q- | [4x — x? - (х? - 4x + 6] dx = | (8x — 2x* — 6) dx 


1 1 


3 3 
— 6x 


1 


= 4x? тый 


x 
i o3 





ә 





1 





Let a curve АВ be given by parametric equations x = (t) and у = y(t) 
where (t) and (f) are continuous functions. Suppose that the function 
v (t) has a continuous derivative e'(f) on the closed interval [o, 8] and 


Ф(а) = а and e(8) = b (Fig. 10.14). Evidently, the area О of the planc 
figure abBA is 


b 
Q= | уах 


а 
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If we put x = e(t) and y = y(t) then we get the following expression for 
the area Q of the plane figure specified by parametric equations 
8 


Q= | We Made. 





Fig. 10.14 


Example. Compute the area of the ellipse given by the parametric equa- 
tions x = a cost and y = bsint, 0 < £ < 2r (a, Б> 0). 
< Since the ellipse is symmetric relative to the x-axis and to the y-axis 
it suffices to compute the area of the ellipse in the first quadrant; the 
desired area is 
a 
Q=4 | y dx. 
0 
Put x = a cost and y = b sint so that dx = —a sin tdt. To find the 
new lower and upper limits of integration we set x = 0 and x =a in 
x=a cost; this gives a cos t = 0 and а-а cos t. Whence f; = a = 3/2 and 
b= = 0. Therefore if x varies from 0 to a t changes from т/2 to 0 so that 








0 0 
0-4 | b sint(—a sin t dt) = —4ab | sin? t dt 
7/2 т/2 
"e 122: 72 
= 4ab | 1980 at = zab (i шэн ТРИ ) = хав» 
2 0 2 0 
0 


Sometimes it is convenient to compute areas of plane figures by apply- 
ing formulas that involve integration with respect to the variable y. In this 
case the variable x is regarded as a function in y, ie, x = g(y) where g(y) 
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is a single-valued function continuous on the closed interval [c, d] of the 
y-axis. The limits c and d of integration with respect to y that are the points 
of intersection of a given curve with the y-axis can be computed as the 
roots of the equation g(y) = 0 obtained by putting x = 0 in the equation 
х = в(у). Then the area О of the plane figure bounded by the graph of 
the curve x = g(y) and the y-axis (Fig. 10.15) is given by the formula 


d 
Q= | gi)dy. 


Example. Compute the area of the plane figure bounded by the parabo- 
la x = 2 — y — y? and the y-axis (Fig. 10.16). 





Fig. 10.15 Fig. 10.16 


~ The limits of integration are the points of intersection of the parabola 
with the y-axis so that putting x = 0, we get the equation 2 — y — y? = 0; 


whence yı = c = —2 and у = d = 1. The desired area О is 
1 1 1 
= 4.5. 


-2 


-2 


x 
2 3 











1 
Q- | Q - у - у?) ау = 2y 
—2 


-2 


Problems. (1) Compute the area of the plane figure bounded by the 
parabola y? = 2х + 1 and the line x — у — 1 = 0; 

(2) Compute the area of the plane figure bounded by the curves 
y = sin ^! x and у = cos ^ ! x and the x-axis. (Hint. Write the equations of 
the curve in the form x = g(y)) ! 
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Area of a plane figure in polar coordinates. Suppose that a curve is 
specified by its polar equation о = f(ẹ) where /(0) is a continuous non- 
negative function on a closed interval [o, 8]. The plane figure bounded 
by this curve and the two rays that start at the pole and make the angles 
a and 8, respectively, with the polar axis is called the curvilinear sector 
(Fig. 10.17). 


p-f(v) 


шээг 


Fig. 10.17 





Fig. 10.18 


To compute the area of the curvilinear sector OA BO we divide the sector 
into n subsectors by drawing л rays р =a = фо; $= фі, ..., 9n- n5 
Фф = В = фп from the pole. Let Agi, Agz, ..., Доп be angles between the 
rays. We let gx denote the angle formed by the rays ёк-: and «x and бк 
the position vector associated with фк. Consider the circular sector of 
radius дк with central angle Ag, (Fig. 10.18). The area of this circular sector 
Я 1. = A 2 
is equal to AQ. = 5 Ox Age ог АОк = 3 “Т.«Фодок since ёк = f(ex). 

When we replace each curvilinear subsector by the corresponding circu- 
lar sector we obtain the plane figure made up of n circular sectors; its area 


Qn is 


31* 
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n 


Q.- Ўр ло, -1 2, f GO dex. 
kzl 


k=1 
We denote the largest Лук by А = (тах Аек. If we make л tend to in- 
&SK&n 


finity, so that А > 0, i.e., if we divide a given curvilinear sector into smaller 
subsectors, then the plane figure made up of circular sectors comes closer 
to the curvilinear sector OABO. Thus we may regard the limit of the area 
Qn as X= пах Аек э 0 as the area of the curvilinear sector OABO 


provided that this limit exists and is independent of a partition of the closed 
interval [a, В] and of a selection Фк associated with a given partition, К = 1, 
25.4, L6, 
n 
: : 1 2 
Q = lim Q, = lim У) 2l (x) Аек. 
470 470 
к-1 
n 


Clearly, the sum 2 lf(G0 Дек is an integral sum for the function 
k=l 


FP) which is continuous on [a, В] since f(¢) is continuous on [a, 6]. 


Hence, the limit of this sum as А > 0 exists and is equal to the definite 
8 


integral | 1 fle) de. 





Fig. 10.19 


Therefore the area of the curvilinear sector OABO is given by 
8 8 
1 1 
go | F ie) de or Q => | ody. 


Example. Compute the area of the plane figure bounded by the cardioid 
о = a(1 + cos 9), a > 0 (Fig. 10.19). 
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-4 The desired area is 


g 2r 22 т 
got | (1 + cos ede = -> | (1 + 2 cos e + cos? e) de 
0 0 
2x 
2 1 2 
-5 | (1+ 200+ 29820. dp 


a? 3 1 3 2 
= mn 222 = — 5 р 
| С +2 с05 Ф + 5 cos 2%) de 7 та 


Volumes of solids. We consider a solid bounded by a closed surface. 
Let Q be the area of the cross section of the solid by a plane perpendicular 
to the x-axis (Fig. 10.20). Clearly the value of Q is dependent on the loca- 
tion of the perpendicular plane relative to the x-axis, i.e., this value is a 

















Fig. 10.20 


function in x so that Q — Q(x). We assume that Q(x) is a continuous func- 
tion on a closed interval [а, b]. To set up the formula for the volume of 
the given solid we draw л planes x = a = Xo; x =X, Xo, ..., Xn- i15 
X = b = x, perpendicular to the x-axis. These planes divide the solid into 
п slices and the closed interval [a, b] into n subintervals [xx - 1, хк], k = 1, 
2. Su Ph 

To approximate the volume of the kth slice we select an arbitrary point 
& in [xx - 1, хк] and consider the cylinder whose generating line is parallel 
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to the x-axis and directing line is the line of intersection of the solid surface 
with the perpendicular plane x = £x (see Fig. 10.20). The volume AV, of 
this cylinder is the product of its height Ax, and the area Q(£x) of its base 
so that AV, = Q(x) Axx, where £c€[xc-1, хк]. The volume of all n 
cylinders thus obtained is 


Vn = Dy AV, = x Q(&) Ах. 
k=] К-1 


If the limit of the sum on the right exists as ^ — (тах Ax, э 0 we set 
Хєл 


it equal to the volume of the solid in question so that 


у= lim Y; Q(&) Ax. 
A20kz-I 


Notice that this sum is an integral sum for the function Q(x) which is con- 
tinuous on the closed interval [g, b]; hence, the above limit exists and is 
equal to the definite integral 


b 
V= | Q) ах. (ж) 


а 





Fig. 10.21 


Example. Compute the area of the solid bounded by the ellipsoid 
2 2 
х RU pt qu 
a b c 
ча The cross section of the ellipsoid with the plane perpendicular to the 
x-axis at x is the ellipse (Fig. 10.21) 
2 2 2 
EE ырыл. 
b с а 
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or 


2 
y z 


+ 
х \2 x? \2 
(è 1- =) (: =) 


with the semiaxes 


x x 
b jl — — апа с 1---- 
А| a? a? 


Then the area of the cross section is 


Q(x) = rbe (1 = х). 








Ht 

















Fig. 10.22 Fig. 10.23 


Applying (ж), we obtain 


а 


2 а 
= | rbe(1 ~ 5) ax nee | (: - 5) ax = $ паво 
a a 3 
0 


-a 
When b = c = a the ellipsoid becomes the sphere х? + у? + 22 = а? 
whose volume is equal to V = i та. » 


Now we consider a solid generated by revolving the curvilinear plane 
figure abBA around the x-axis (Fig. 10.22). The plane figure is bounded 
by the graph of the curve y = f(x), the lines x = a and x = b where a < b 
and the x-axis. The solid thus obtained is called the solid of revolution. 
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The cross section of the solid of revolution with the plane perpendicular 
to the x-axis at x is the circle whose area is Q(x) = «y^ = vf? (x); hence, 
the volume of the solid of revolution is 


b b 
V= т | Pœ) ах or И= т {| у? dx 


Example. Compute the volume of the solid of revolution generated by 
revolving the arc OA of the parabola y? = 2px around the x-axis 
(Fig. 10.23). 
ча The arc OA is specified by the equation y = V2px where p > 0; 
hence, the desired volume is 

Ver | ydx=n | 2px dx = пра?. > 
0 0 


10.6 Computing Arc Lengths by Integration 


Arc length. We consider the arc ÀB with the endpoints A and B 
(Fig. 10.24). Let Mi, №, ..., Мһ-\ be points оп AB If we join these 





Fig. 10.24 


points by line segments AMi, М.М, ..., M,- 1B with lengths Ая, As, 
..., Аѕп, respectively, we obtain the polygonal line AMiM, ... Mn-1B 
inscribed in the arc АВ. The length S, of this polygonal line is 

n 


S, = As; + A +... + Atm = У Ask. 
k=1 


Definition. The length S of the arc AB is the limit of the length S, of 
the polygonal line inscribed in AB as the largest of the line segments of 
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this polygonal line tends to zero, i.e, 
n 


S= lim S= lim У As 
maxAsy 0 тах 4540 к _ | 

provided that this limit exists and is independent of choices of the points 

Mi, М, ..., M4. | on the arc AB. 

Arc length of a curve in Cartesian coordinates. Let an arc AB be speci- 
fied by the equation y = f(x) where the function f(x) has a continuous 
derivative f' (x) on a closed interval [а, b]. Consider a partition of [a, b] 
into л subintervals [xx —1, xx], k = 1, 2, ..., n by choosing points such that 


X9y—a«xp«x«... «Xx 1«« Xi <... <= Б. 


y 

















— 
[6 Ig-a Ik Xk б=ху r 


Fig. 10.25 


Let (хк, (хк) be a point on AB corresponding to the kth subdivision point 
хк of the polygonal line. Then the polygonal line inscribed in AB is given 
by the points 


A = Moo, f(xo), Mi, Sa), ..., B= Mas, fn). 


We let As; denote the length of the kth line segment of this polygonal line 
and Ax, = Xk — хк-1 and Ayk = f(xx) — Д(хк-1), К = 1, 2, ..., n. Then 
the length of the Ath line segment (Fig. 10.25) is 


Азу = N (Axx)? + (Ax) = fl + (2 ) Ах. 





Ахк 
Applying the mean value theorem for derivatives we obtain 


Дук = Дк) — fox -1) = Gk — xx- 1) f" (&) = F’ (E) Ахк, 
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where & is a point іп [хк-1, xx]; whence 
Ask = V1 + [/'( О]. Ах 


so that the length of the polygonal line is 


$- УАЙ r Or Axe. (9) 
к= 1 


n 


Recall that f'(x) is continuous on [ag, b]. This implies that 


V1+ [/' Од] is a continuous function on (4, Б]. Hence the integral sum 
on the right of (*) has a limit S as max Asx > 0 so that 


n 


n b 
S= lim 5) М1 + [P (OP Axe = | Vi + 0 GO" dx 
k=1 a 


maxAsy 0, — 
Ixk&n 
or, using the abbreviated notation, 
b 
S= [v1 +(у,)* dx (жж) 
а 
Example Compute the arc length of the cafenary у = 


5 ын — cosh x bounded by the points А(0, 1) and B(a, cosha) 


(Fig. 10.26). 
ч On differentiating the equation of the catenary we have 








4 


y’ = sinhx. 
Noting that cosh? x — sinh? x = 1, we obtain 


Мі + Q'Y. = УІ + sinàx = Усоѕћ2 х = cosh x (cosh х > 0) 


so that 





a a 
S= | cosh x dx = sinhx| =sinha. » 
0 0 


Arc length of a curve specified by parametric equations. Let AB be an 
arc of a curve specified by the parametric equations 


х= e(t у = (0), to &t«T. 


We also assume that the functions (t) and y(t) have continuous deriva- 
tives e' (f) and ү’ (t) on [f%, T] and e' (f) = О on [fo, Т]. The original para- 
metric equations specify the function y — f(x) having a continuous 

E at 
derivative yy = T on [fo, T]. Then 


NT 4012) dx =V Te" (OY? + V (P? dt 


10.6 Computing Arc Lengths by Integration 491 





and formula (жж) yields 


T 
S= | Уе ОР + WCF dr 


fo 
or 


T 
S= | N (xc Y. + OY dt. (ж+ж) 


Examples. (1) Compute the length of the circumference of the circle 
with radius R (Fig. 10.27). 








Fig. 10.26 Fig. 10.27 


ча The circle with radius R is given by the parametric equations 
x= R cost, y=Rsint, Ox t < 2s. 


Then applying formula (***) we obtain 
2x 2r 


S= ( V(-R sin £}? + (R cost dt = R | dt -2xR. Ф 


0 0 
(2) Find the length of the ellipse given by the parametric equations 
x=acost, у= bsint, 0<{<2т (0«b«a). 


ча Sincex/ = -a sint and y; = b cost and the ellipse is symmetric rela- 
tive to the coordinate axes, formula (***) gives 
31/2 
S=4 | Ма? sin? t + b? cos? t dt 
0 
12/2 
=4 | Na?(1 — cos? t) + b? cos? t dt 


0 
1/2 31/2 
=4 | Ма? — (a? — b?) cos? t dt = 4a | М1-61совг dt, 


0 0 
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а? —b? 
where € = m is the eccentricity of the ellipse, 0 < € < 1. 


The definite integral on the right is called the elliptic integral; it is worth 
noting here that the Newton-Leibniz theorem is inapplicable for computing 
this integral since the antiderivative is not an elementary function. №» 


Notice that the substitution / -5 — 7 yields 
3/2 31/2 1/2 
| V1 7 е? cost dt = | М1 – е? яа т dr = | Vi- e sin’t dt. 
0 0 0 


This is the notation mostly used for the elliptic integral. 








Fig. 10.28 


(3) Find the length of one arc of the cycloid specified by the parametric 
equations (Fig. 10.28) 


x=a(t—sint), y=a(l— cost), Ox t < 2r (a> 0). 


ч Applying formula (*««), we have 





2x 2x 
S=a | N (1 — cost + sin? t dt = a | v2 — 2cost dt 
0 
2T 2r = 
Е least az t 
=a | 4зиё dto 2а | sin а 
0 0 
2x 
21 t 2* 
= 2a sinz dt = —4a cos- = 8a. > 
2 Pa 





0 


Arc length of a curve specified by its polar equation. Let AB be an 
arc of a curve specified by its polar equation о = До). оп a closed interval 
[a, 8] where f(y) has a continuous derivative f’(y). To find the length 
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of the arc АВ we derive the parametric equations of the curve. Recall that 
the relations between the polar and Cartesian coordinates are given by 
X = о cos e and y = o sin e. Replacing о by f(o), we get the parametric 
equations x = f(y) cos e and y = f(y) sin e that describe the given curve. 
We use here the polar angle р as the parameter. 

On differentiating the parametric equations, we have 


x, = (ф)со8Ф- f(g) sine 
and 
Yo = /' (её) $їп e + Дф) cos e. 
Squaring both identities and adding (x;)? with (y;)^, we get 
(x + (Y = LG) + DG». 
Then formula (***) gives 
R 2 2 
S= | МӨӨГ + LY de 
or, equivalently, 
B 
S= | Мо? + (0') de. 
Example. Find the length of the cardioid given by the polar equation 
о = а(1 + cos д), a > 0 (see Fig. 10.19). 


ча On differentiating the original equation we obtain о’ = —a sine = 
/'(ф), 0 € v € 2v, so that by symmetry 


х 


5-2 | Ма (1 + cos oy, + a’ sin? e de 


У2 (1 + cos e) de = 4a | [cos £ de 
0 


Now we consider an arc of the curve y = f(x) shown in Fig. 10.29. Sup- 
pose that the function f(x) has a continuous derivative f’ (х) on the closed 
interval [a, b]. We let AM denote the arc bounded by the points A(a, f(a)) 
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and M(x, f(x)). If we think of A as a fixed reference point and of M as 
a point that can move along the curve the length of the arc AM becomes 
a function in x. This length S is expressed as 


5- | VI + у Qo? ах = Ї V1+ UP dt. 


Since the integrand on the right is a continuous function on [a, b] we 


can write 
x 


450) _ d | 1 4 OF а) = X14 [P G0 


' dx dx 


or 


dS dy V 
we cut (2) | 
Whence the length dS of the arc AM becomes 


dS= |1 + (2) dx ог dS = N (dx? + (ау) . 


dx 






P N(x + пх, y*dy) 





Fig. 10.29 


To interpret the above formulas geometrically we look at Fig. 10.29. 
Clearly, the value of dS is equal to the length of the line segment MN 
of the tangent MT between the points M(x, y) and N(x + dx, y + dy). 

For sufficiently small dx = Ax the length AS of the arc ММ” of the 
curve y = f(x) corresponding to the increment Ax = dx can be thought of 
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as being nearly equal to the length of the line segment MN of the tangent 
MT to the given curve at the point M, ie, AS = dS. 
When a curve is specified by the parametric equations 


x= e(t), y = V, b St <T, 

where (г) and y(f) have continuous derivatives on [f, Т], then 
dS = N[e' (Y. + [v (P. dt 

Or 


dS = N (x y + OY y dt. 


If we put the parameter f equal to the length S of a variable arc, i.e., 
X = (S) and y = v(s), then the above formulas become 


dx M dy MV 
(as) + Gs) = 


For a curve given by the polar equation o = Де), a € ẹ < В, where 
f(e) is a function having a continuous derivative oh the closed interval 
[a, 8] we have 


45 = Мо? + (o'Y de. 
10.7 Applications of the Definite Integral 


Work done by a variable force. Suppose we wish to define a work 
done by a force F in moving a particle M along the x-axis from point a 
to point 5 (а « b). Recall from the course of general physics that the work 
W done by the constant force F is the product of the absolute value F 
of this constant force by the distance S = b — a, ie, W = FS, provided 
that F is directed along the x-axis. 

Let a force F acting on a particle M along the x-axis be a continuous 
function in x (Е = F(x)) on the closed interval [a, b] of the x-axis. We parti- 
tion the closed interval [а, b] into n subintervals with lengths Axı, Ax», 
..., АХд by choosing points in [g, b] such that x =a <x <x% <... < 
Xn = b. Assume that £ is an arbitrary point in the subinterval [xk 1, хк] 
and the force Е is constant оп [хк 1, xx] so that Е = F(£x). Then given 
a sufficiently small Ax, the work AW; done by Е (£x) in moving the particle 
from хк: to xx is AW, = F (tx) Axx and the sum 

n n 


Ил = >, AW, = >) F(&) Axe 
k=1 к-1 


approximates the work W done by the force F on the closed interval 
[а, b]. Since И» is the integral sum for the function F(x) on [a, Б] the 
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limit of W, as max Ax, > 0 defines the work W done by the force 
х gn 


F on [a, 5] (this limit exists, for Р(х) is a continuous function on [a, 5]). 
Thus the desired work W should be 


n b 
W= lim У Е) Ах = | F(x) dx. 


' max Ахк?0 gly 
l<kgn a 


Example. Let two point charges qı and qz be located at the points Mo 
and Mi, respectively, and let г be the distance between Мо and Mi. Define 
the work W done in moving the charge qz from Mi; to М assuming that 
the distance between M; and Mi is equal to r» and the point Мо is the 
origin of the frame of reference. 
~ Suppose that both charges are positive, i.e., 44 > 0 and q2 > 0; hence, 
they repel each other. By Coulomb's law the absolute value F of the force 
of the electrostatic interaction of two point charges in vacuum is 


F= k PP 
F 
where r is the distance between the charges and k is a constant. 
Then applying the above formula, we obtain 


г. Т 
2 


^ d 
W = | k VT. adr = каф | 4 = kae(- 1) 
(4 r r 


г fi 


1 1 
= --- — —— P" » 
кае (4 г) 


The mass and centre of gravity of а rod of varying density. Consider 
a rod of varying density. Think of the rod as a closed interval [a, Б] of 
the x-axis and suppose that the linear density of the rod is defined as a 
function р = р(х). 

We partition the closed interval [a, b] into n subintervals [xe - 1, Хк], 
К = 1, 2, ..., n, by choosing the points 


г, 








fy 


Xu«a«x«x«... SM < Xn =b. 
We let & denote an arbitrary point in the kth subinterval [хк-1, xx], 
k=1, 2, ..., n, and consider the sum 


У) o(&) Ах. 
k=1 


Clearly, each summand gives the approximate value of the mass of the 
part of the rod corresponding to the subinterval [xx - 1, хх]. Then this sum 
determines the approximate value of the total mass of the rod so that we 
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n 
can define the total mass т as the limit of 5) o(£x) Ах as imax Ax > 0 
k=1 &K&n 


b 
which is equal to the integral | e(x) dx. Thus, the total mass m of the 


rod of varying density g(x) is given by the formula 


b 
m- | о(х) dx. 
ü 
To define the centre of gravity of the nonhomogeneous rod we shall 
use the formula for the centre of mass of n particles Mi, M5, ..., M, 
with masses mı, т», ..., m, located at the points xi, X2, ..., x, on the 


x-axis. The centre of mass x. of this system of particles is given by the 
formula 


n 
> INKXk 
тх + (ax) +... + 4МлХ к-1 


my + И +... + тһ n 
2, тк 


k=1 





Let us partition the closed interval [a, b] into л subintervals [Хк-1, хк] 
by choosing the points а = xo < xi < ... < b = x, and compute the mass 
тк corresponding to the kth subinterval, А = 1, 2, ..., и. Clearly, 


Xy 
тк = | о(х) dx. Then applying the mean value theorem (Theorem 10.5), 


Xe 
we get 


тк = о(&к)(хк — хк-1) = e(£) Ахк, 
where хк € £x < Xx. 

Assume that a particle with mass тк is located at some point £ in 
the subinterval [хк-1, хк]. Then we can replace the rod by a system of 
n particles with masses тк located at the points £i, £, ..., £ in the closed 
interval [а, b]. Since 

n n x b 
У т= У | об)ах= | oxyde =m, 


к-1 Кел, bs 


the centre of gravity of the nonhomogeneous rod can be approximated as 


>) &o(Ex) Ax 
k=1 


m 


х = 


The nominator is the integral sum for the function x o(x) on the closed 
interval [а, b]; hence, the centre of gravity of the nonhomogeneous rod 


32 — 9505 
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is given by the formula 
b 


| х о(х) ах 
Е : 
| ео) dx 


a 


х = 


Example. Define the centre of gravity х, of the nonhomogeneous rod 
with linear density o = x and length / = 1. 
ча The mass of the rod is 


1 d 1 
m — хах = =. 
| ra=; 


The desired centre of gravity is 


: 2 
хо-2| x!'dx-£. > 
à 3 


10.8 Numerical Integration 


In dealing with physical problems we frequently work with definite 
integrals involving continuous functions that have no elementary an- 
tiderivatives. To compute these integrals we have to numerically approxi- 
mate them. Given below are two convenient approximations, namely, 


trapezoidal approximation and parabolic approximation. 
b 


Trapezoidal approximation. We wish to compute the integral | fo) ах 


where f(x) is a continuous function on the closed interval [g, Б]. For sim- 
plicity we set f(x) > 0. Consider the partition of [a, b] into n subintervals 
of equal length (the regular partition) by the points 


а= а< < 0 <... < Xn-1X€ Xs = D. 


Let us draw n vertical lines x = xx, k = 0, 1, ..., n and construct n trape- 
zoids as shown in Fig. 10.30. The total area of these trapezoids is approxi- 
mately equal to the area of the curvilinear figure aABb so that 

b 


| ix = 209109 Gy, y, LO) fen 


Qo — хм) 


a 


E f - 1) + Л) 








Spr oes 7 (Xn — Ха-1) 
n-1 

- 2а [ло + no) +2 2) fo]. 
k=1 
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where /f(x«-1) and f(x.) are the bases of the trapezoids and 
b-a 
Xk — Xk-1 = 





are their heights. 


Thus we arrive at the formula 


b п-1 
| fo) dx = 2 ын [ло (ру 22297 e| 


k=1 





called the trapezoidal approximation. Notice that the larger is n the higher 
precision of approximation is obtained. 











Fig. 10.30 


It is worth noting that given a function f(x) having the continuous sec- 
ond derivative f" (x) on [a, b] the absolute error of approximation does 
not exceed the number 

—d 
M 6-20 
12n 


kd 


where М = max |f" (х). 
axx&b 


1 

: 21225 а. 
Example. Compute the trapezoidal approximation to | Vr 
with n = 10. 0 

~ Consider the regular partition of (0, 1] by the points хо = 0, x, = 0.1, 
0 = 02, ..., ху» = 0.9, хо = 1. Let us compute the values of the function 


fe) s 


х +1 





at these points. We have 


32* 
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ЛО) = 1.0000, (0.1) = 0.9091, J(0.2) = 0.8333, 
(0.3) = 0.7692, — f(0.4) = 0.7143, f(0.5) = 0.6667, 
J(0.6) = 0.6250,  f(0.7 = 0.5882, /(0.8) = 0.5556, 
f(0.9) = 0.5263, Л) = 0.5000. 

Then ane trapezoidal approximation is 


dx _ 1 [1.0000 + 0.5000 
x+1 10 2 





+ 0.9091 + 0.8333 + 0.7692 


+0.7143 + 0.6667 + 0.6250 + 0.5882 + 0.5556 + 0.5263) 
= 0.69377 = 0.6938. 


Let us estimate the error of the computations. Since 


з 2 Hen yo Se n and f” (x) = wg 


On the closed interval [0, 1] we have |f"(x)| <2 and, consequently, 
М = max |f" (x)| = 2; hence the error does not exceed the number 
5х5 
МЕ ау) _ 2 1 


= = - « 0.0017. 
12n? 12 x 10? 600 





The Newton-Leibniz theorem gives the exact value of this integral as 
1 


1 
| ete dy elio 0.00115: 
ха 0 





The absolute error in the trapezoidal approximation is smaller than 
0.0007; hence both these results are in full agreement with estimates given 
by theory. » 

Parabolic approximation. We start with computing the area Q of a plane 
figure bounded by the arc AB of the parabola y = Ax? + Bx + C passing 
through the points Mo(0, yo), Mi(Ah/2, yı) and M2(h, y2) (Fig. 10.31). This 
area is given by 

h 











x h x h h 
Q= | A + Bx CO dx-A—| 48-51 + Сх 
3 221: А 
? 3 2 
- ADI. + BŽ + Ch = Ë QAM! + 3Bh + 6С). (+) 


Now we express the area О in terms of ordinates of Mi, M» and М». 
Substituting the coordinates of these points into the equation of the given 
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parabola, we obtain 


»- C n= 0 вй + С, у = Ah? + Bh + C. 
Whence 

2Ah? + 3Bh + 6C = уо + 4у + у 
so that 


Q = = (у + 4i + у»). 


ox 


у= Ax? + Вх «C 





Fig. 10.31 


b 
Let us consider the definite integral | f(x) ах where f(x) is a continuous | 


а 
nonnegative function on [a, b]. We divide [a, b] into 27 regular subintervals 
(notice that 2n is even) by the points 


a—Xo «Xi € X2 <... < Xan-2 < Xon-1 < Mn =D 


and write down the original integral in the form 


b х х, Xj 
| fo dx = | Дх) dx + | Дх) ах +... + | F(x) dx. (жж) 
а Xo Xx Хїя-2 

We let A, Mi, Mo, ..., Mon-2, Mon 1, B denote the points of intersec- 
tion of the vertical lines x = xx, k = 0, |, 2, ..., 2n, with the graph of 
the function y = f(x); let yo, Ул, Y2, ..., Y2n- 2» Уп - 1, Yan be the ordinates 
of the respective intersection points. If we draw a parabola with vertical 
axis of symmetry through every triple of points Мк 2, Moy 1, Мк (К = 1, 
2, ..., п) we get п curvilinear plane figures bounded above by the parabolas 


(Fig. 10.32). Since the area of the curvilinear plane figure corresponding 
Zf is approximately 





to the subinterval [хк-2, хк] with length Л = р 
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equal to the area of the respective “parabolic” trapezoid, formula (*) gives 
Xn 
b-a 
Јох) dx = бп (узк-2 + 4уэк-1 + Рк), 


Хэк-2 


where yk = f(x), k 21,2, ..., n. 
Then formula (ж») becomes 








b 

b- 

| se) ax = " [yo + Yon + 202 + ya +... + уэл-2) 
+ AQ + уз +... + ym-1). 


This is the parabolic approximation which is sometimes called Simp- 
son's approximation. 


Xon-2 *2n-1 Xan 8 





Fig. 10.32 


Notice that when f(x) has a continuous fourth derivative f(x) on 
[a, b] the error in the Simpson's approximation is at most 


(b — ay 
M 
2880n* ' 


M = max |/? (х). 
where max |f* (x)| 


The error in Simpson’s approximation decreases faster than that in the 
trapezoidal approximation as п increases; so the former is more precise than 


the latter. 1 
: ХЭТ : ах 
Example. Compute Simpson's approximation to the integral | X 

0 


with 2n = 4. 
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ча Consider the regular partition of the closed interval (0, 1] by the points 
Xo = 0, xı = 0.25, x; = 0.50, хз = 0.75, x4 = 1. The values of the function 


at these points are yo = 1.0000, y; = 0.8000, y; = 0.6662, 





ы 1 
p x+1 
ys = 0.5714, ys = 0.5000. Then Simpson’s approximation is 

1 


ax Б- 
| SOT ey Do tya 206 + 401 + Ээ = 





a [1.0000 + 0.5000 + 2 x 0.6662 + 2(0.8000 + 0.5714)] = 0.69325. 


Let us estimate the error of the result thus obtained. The integrand 











1 1f xil 4) 24 
x)= has the fourth derivativ = ———"__ so that 
fix) = 5 ive JOW = Tops 
4) 24 
M = max |f(x)| = max | ———, | = 24. 
0<х<1 0<х<1 | (1 + x) 
Hence the error is at most MK. NN « 0.0005. Comparing Simpson's 
2880 x 2 


approximation with the exact value of the integral we conclude that the 
absolute error in Simpson's approximation is less than 0.0001 as it has been 
predicted above. 

The computations of the trapezoidal approximation and Simpson’s ap- 





; : : dx : . 
proximation to the integral 1 prove that the latter is more precise 


than the former. № 9 


Exercises 
Compute the following integrals by applying the Newton-Leibniz 


theorem 
2e 


4 
1 т 
1. | х2Ухах. 2. | (ve - xj dx. 3. | Lun 4. | cos x dx. 





0 Vx ) х 172 
1/42 1 Р, А 3 
dx x xdx 
5 ——. 6 | 7 x|dx 8. | А 
"a N 1- x | | Ух +1 
= 0 


-3 


Lex 
е^ а 1 
9. | ELA 10. | = 11. = 
e 0 





425 + 3x 
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4 2 1 
" | In (x + V9 + x?) de | хах i | xdx 
V1+ x4 V4 — x4 


2 
У 9+ х $ 


1/2 


7/2 2 
yo. cos x — anx 
15. | 35655 sin 2x dx. 16. —————À—— dx. 
0 cosx + sinx 


e 
7/12 1 
17. | Insin2x:cot2x dx. 18. | UM e 19. | sinh? x dx. 
0 0 

1 
In2 1n3 


20. | tanh x dx. 21. | tanh? x dx. 
0 0 


Integrate by parts each of the following integrals 
2a 1 1 


22. | хяпхах. 23. | In (1 + x) dx. 24. | xsinh x dx 
0 0 0 


1/42 1 
25. | sin !xdx 26. | x? е"/? ах. 
0 0 


Compute the areas of the plane figures bounded by the given curves 
and the indicated axes 


27. y = x? + 2x — 3 (parabola), y = x + 3 (line). 

28. y = 2x — x? (parabola), y = —x (line). 

29. y = x? — 1 (parabola), x = 2 (line); the x- and y-axes. 

30. у = х? – 3x — 4, y = 4 + 3x — х? (parabolas). 

31. y=x-1, y=1, y=In~x. 

32, y=e 7, у= е*, х = 1. 

33. у = х?, у = 8; the y-axis. 

34. х = a cos! t and у = a sin? t, a > 0 (astroid). 

35. x = a (t — sint) and y = a(1 — cost), a > 0 (an arch of a cycloid); the 

axis of abscissas. 

36. x = a (2 cost — cos 2t) and y = a(2 sint — sin 2), a > 0 (cardioid). 

37. о =a sing, а > 0. 38. о =asin2y, a > 0. 39. о = 2 + sing. 
Compute the volume of the solid generated by revolving the given curves 

or the given plane figures around the indicated axis 


40. y = sinx, 0€ x € v; around the x-axis. 
41. y = sin? x, O € x € т; around the x-axis. 
2 
x Т : 
42. — + ar = ] (ellipse); around the x-axis. 
a 
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43. The plane figure bounded by the parabola у = ax — x?, a > 0, and 
the x-axis; around the x-axis. 
44. The plane figure bounded by the parabola y = x’, the line y = 1 and 
the y-axis; around the y-axis. 
45. Compute the volume of the solid bounded by the paraboloid 


у? 2? 

cp "pg ane the plane x = a. 

46. Compute the volume of the solid bounded by the hyperboloid of one 
2 2 2 


x | 
sheet Zy + mp = 1 and the planes z = 0 and z= h. 
a c 
Compute the length of the arc of the given curve between the indicated 


points 
2 


41.y- M from (0, 0) to (1, 1/2). 


48. y - Vx (semicubical parabola) from the origin of coordinates to 
A(1, 1). 

49. y = Inx; from x = УЗ to x = V8. 

50. y = Insinx; from x = 7/3 to x = 7/2. 


51. x = a cos?! t and y = a sin! t, a > 0 (astroid; define the total length). 
3 


52, х= $ — t and y= Ê + 2; from t = 0 to t = 3. 


3 
53. х = e cost and у = e' sint; from f= 0 to f= ат. 
54. o = ae*, a > 0, (logarithmic spiral); inside the circle o < а. 
55. о = a sine, a > 0; (total length). 


56. о = аф, a > 0, (spiral of Archimedes); the length of the first winding. 














Answers 
8 2 
ic-x*. 3mz4c-ibs56:67*. т. 65.8. ^5.9 —^.10.2. 11.1. 
17 3 2 4 3 3 
35 1 т 2 In 2 : 23 
12. — In? 3. 13. — In (4 + N17 ). 14. —. 15. — —. 16. 0. 17. -——. 18. sin 1. 19. sinh? 1. 
2 2 4 1n 3 4 
4-52 
20. -а2. 21. In 3 - Â, 22. —2r. 23. 1n4 — 1. 24. e^. 25, TŻ > . 26. 2 — Ve. 
3 
27. А 28. 4.5. 29. 2. 30. a 31. e — 2.5. 32. 2 cosh 1 — 2. 33. 12. 34. A ла”. 35. 3ra’. 
2 2 2 5 
2 та та 9 т 3 , 4 2 та T 
. —. 38. . 39. . 40. . 41. .42. b’. 43. . 44. —. 
36. 6та?. 37. ——. 38. —2 39. єт. 40 2 4. ет 5 "0 30 5 
h? 1 13413 – 8 
45. та'/рд. 46. «n (1 x) 47. у? + In(1 + %)]. 48. E е 


49. 1 ezinl. 50. —In3. 51. ба. 52. 12. 53. (x – 1)/2. 54. av2. 55. та. 


юе 


56. та\/1 + 4r? +518 (ох + М1 + 412 ). 


Chapter 11 


Improper Integrals 


11.1 Integrals with Infinite Limits of Integration 


The notion of the definite integral involves a function which is de- 
fined on a closed (bounded) interval [a, Б] so that the interval of integration 
is always bounded. Certain applications of integral calculus lead to integrals 
of functions defined over unbounded domains; these are infinite semi- 
intervals of forms [a, +œ), and (— «o, Р] or the infinite interval of the 
form (— оо, + хо). For example, we encounter such integrals when comput- 
ing the potential of gravitational or electrostatic forces. 

To make the notion of the definite integral applicable to unbounded 
intervals of integration we must rely upón some new concepts and defini- 
tions which clarify the meanings of the symbols 


+0 b To 


| /одах, | Ло)ах, | fix)dx. 


Let a function f(x) be defined for all x > a and integrable, say continu- 
ous, on every finite closed interval a < x < b where a is a given number 


and b (b > a) is any arbitrary number. To deduce what we mean by writing 
+ о 


| Ло) ах (*) 
a 
Korn is called the improper integral, we consider the function J(b) = 
= | f(x) dx of the variable b (b > a). 
a 


Definition. If the function J(b) has a finite limit L as b > + со the im- 
proper integral (ж) is said to converge. 
In this case we write 


T дә dx — „іт | Ах) ах = L. 


Suppose that the function ДР) has no (finite) limit as b > + оо. Then 
the improper integral (*) is said to diverge; in this case no numerical value 
is assigned to this integral. 
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m 





Examples. (1) The improper integral | converges and is equal 








2 
x 
т 
to 2 5 
*4 Indeed, by the definition we have 
+œ b 
dx ? dx К = T 
= lim = lim tan '‘b==. » 
| 1 + x’ Jim. | 1+x b- ев 2 
+0 


(2) The improper integral | cos x dx diverges. 


0 
b 


4 Indeed, the integral | cos x dx = sin b has no limit as b > + œ; hence, 
it diverges. 9» й 
(3) Let two charges qı and 4 be both positive so that they repel each 
other. By the Coulomb's law the absolute value F of the force of electrostat- 
ic interaction of two point charges in vacuum is given by 
k 
Е = 9192 


г? o’ 





where r is the distance between the charges and K is a constant. 

Suppose that the charge qı is located at the point Mo which is chosen 
as the origin of some reference frame and the charge qz is located at the 
point M. We let r, denote the distance between the ponts M and Mo and 
compute the work done in moving the charge 4 from M to infinity. 
ча The desired work W is defined as the improper integral 


+ © To 
k а 
w- | E? а-аа | Eg 
r r 
n n 


By the definition we have 





е lim М.а 2 
гал b> +% \ Ti b n 


k : 
. Let Ф be a unit charge. Then W = 28, this 
1 


Therefore, W = 22 
1 


quantity is called the potential of the field induced by the charge qi. 9 
+ о 


(4) Consider the integral | ах а = const. 
x 


1 
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We wish to determine the values of a for which the integral converges 
and those for which it diverges. 
ча By the definition of the improper integral we have 


+% 


b 
| dx . | dx 
. = lim a 
x b> +œ x 


1 1 


Let a » 1. Then 








b bi-« 1 





b 
| dx _ xe 


Хо 1-а 





| l-a 1-а 
1 


so that for а > 1 we have 
b 
: dx : bs 1 I 
lim — = lim - - : 
bo +00 x” b> + l-a 1-а а – 1 
1 


Thus the original integral converges if о > 1. For a < 1 the integral | 





b 





1 
has no finite limit as b > + oo; in other words the integral diverges if a < 1. 
Now let a = 1. Then 


b 


b 
| ES = | dX nb 
x x 


1 1 





Whence it follows that for a = 1 | Ld > +æ as b> 400, 
Therefore we infer that i ii 


+o 


the integral | 
1 





dx (converges if а > 1, 
х" (diverges if o « lI. 


This result can easily be interpreted geometrically. Consider the region 
D bounded from the left by the line x = 1, below by the x-axis and above 
by the graph of the function y = 1/x? (Fig. 11.1). This region is unbounded 
from the right, i.e., it extends to infinity. By convention we regard the limit 
of the region bounded from the right by the line x = b provided that 
b — + оо is the area of the infinite region D (Fig. 11.2). It is easy to see 
that for а > 1 the area of the region D bounded above by the graph of 
y = 1/x* is finite; if D is bounded above by the graph of the hyperbola 
y = Vx or of the curve y = 1/x* with a < 1 the notion of the area of D 
has no meaning. 
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RY 


Fig. 11.1 








Fig. 11.2 


+ % 
ах 


a 


Remark. Notice that given any a > ô > 0 the integral 





converges 

x 
for а > 1 and diverges for a < 1. а 
The definition of the improper integral 


To b 


| Л) ах= lim | Л) ах 
b> +œ 


a a 


implies directly the following 
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+ m 
(1) If the integral | f(x) dx converges so does the integral | (x) dx 
a a 


where А is any real number and 


ү M(x) dx = Г Ах) dx. 


(2) If both T Ах) ах and 15 ф(х) ах converge so does 
T (f(x) + ф(х)) dx and 
| Оо) + e@)dx= | Ло) ах + | p) ах 


“4 Indeed, given any b > a there holds 


b b b 

| Ue) + e&) dx = | ло)ах+ | б) ах 

Both integrals on the right have limits as b > + œ; hence, the integral on 
To 

the left also has a limit as b > + о, ie, | (f(x) + ф(х)) ах converges. 


a 
Evaluating the limit of the above identity as b > + со, we obtain the desired 
result. » 


+ 
Problem. Let the integral | (f(x) + e(x)) dx converge. Determine 


+ о +o 
whether or not the improper integrals | f(x) dx and | v(x) dx converge. 
a a 


(3) Let u(x) and v(x) be continuously differentiable functions on the 
line x >а. Then 


+ + to 
| и dv = (u(x) v(x)) - | v du (integration by parts) 
a a a 


provided that at least two of the three terms involved make sense (notice 
that in this case the third term also makes sense). 
+ < 
Example. Integrate by parts J; = | X"e * dx, where n is a natural 
0 


number or zero. 
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4 We have 
+ + < + 
Jn = | x"e dx = —(х"е *) +n | х”7!е7*ах=пА-\, 
0 0 0 
n--l,2,.:. 


Notice that 


+e 
| е-*йх=1 


a 


Jo 
we get =n! В» 


11.2 Integrals of Nonnegative Functions 


In a variety of applications it suffices to determine whether a given 
integral converges or not without having to evaluate it directly. Convergence 
tests are convenient devices in many instances. Here we formulate some 
valuable convergence tests for improper integrals in the forms of theorems. 

Theorem 11.1. Let f(x) апа ф(х) be integrable functions on [a, b] for 
any b >a and let 0 < f(x) < ф(х) for all x > a. Then 
+a +o 
(1) If | e(x) dx converges so does | f(x) dx. 
a a 
To To 
(2) If | f(x) dx diverges so does | ф(х) ах. 
а а 
: + 
"4 First we suppose that the integral | p(x) dx converges and prove that 


a 
+0 


so does | Лох) dx. In other words we have to show that the function 
a 


b 
J(b) = | Лх) ах has a finite limit as b > + œ. 


a 
Since f(x) is a nonnegative function for all x > а J(b)must be a non- 
decreasing function in b. Indeed, if bj > b then 


b, b b, b 
J(bi) = | Јо) dx = | f(x) dx + | Јо) dx > | f(x) dx = ЛБ). 
a a b a 


Also, since f(x) < ф(х) for all x > a then given any b > a there holds 


b b 
| feddx < | e) dx. 


a a 
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b 
The value of the integral | ф(х) dx is not larger than that of the in- 


a 
+ а 


tegral | ф(х) dx; the latter integral converges by the hypothesis. Hence, 


a 


given any b > a there holds 


b +e 
л = | Лак | ебдах=1. 


a a 
b 
so that J(b) = | Лх) ах is a nondecreasing function in b which is 


bounded above (as b > + ә). This means that J(b) has a finite limit as 
To 


b> + оо and by virtue of the definition the integral | f(x) dx converges. 
a 
Now we prove the second part of the theorem. Let the integral 


To 
| Лх) dx diverge. Assume the converse, i.e., assume that the integral 


ü 
+ с 
| ф(х) dx converges. Then the first'part of the theorem tells that the in- 


ü + © 
tegral | f(x) dx must converge and that contradicts to the hypothesis 


a 
of the theorem. Hence, our assumption is false and the integral 
+a 
| ф(х) ах diverges. 9» 
a to 
= ж 


. . : e 
Example. Investigate the improper integral T NEP EE ries 
] t x^ sin х 


-xr 


e 


———3,——4- is such 
1+x° + sin’ x 


“4 Notice that for all x > 0 the function f(x) = 


that 


-r 


e 
Ixxl-six  1+x 





0< y = (х). 
+o 


The integral | TE. converges (See Example (1) on p. 507). Then 
tx 


0 
by virtue of part (1) of Theorem 11.1 the integral in question also converges. 
It is worth mentioning here that we fail to investigate this integral by apply 
ing the definition of convergent integrals. 9 
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Theorem 11.2. Let f(x) and p(x) be continuous nonnegative functions 
for all x 2 a and let ф(х) be distinct from zero for all sufficiently large 
+a +o 


x. Then both integrals | f(x) dx and | ф(х) dx either converge or diverge 
a a 


provided that there exists 


x2 + p(x) 


=k #0. 


ч Indeed, let lim Fe) 
Xo +0 ф(х) 


means that given any number ғ > 0, say € = : » 0, there exists a number 
N such that for all x 2 N 


— k » 0. By the definition of the limit this 








Jo») k 
——-k| <e=2 
e(x) 2 
or, equivalently, 
k Јо) 23 


Recall that (х) > 0; then the above inequality becomes 


к (x) < Дх) < 2 Ко) vx2N. 


Theorem 11.1 implies that 
+ о + < 

(a) If | ф(х) dx converges so does | Рх) dx since f(x) < 3 ke(x), 
N N 
To to k 

(b) If | ф(х) dx diverges so does | f(x) dx since 7 g(x) < f(x). 


N N 
+œ 


Proceeding in a similar way we infer that if | То) dx converges 
N 


+ о 
(diverges) then | ф(х) dx also converges (diverges). 
N 


+0 To 
These results remain valid for | f(x)dx and | ф(х) ах due to the fact 


a a 
+0 


that the integral | 8(х) dx converges or diverges simultaneously with the 
a 
Teo 
integral | g(x) dx, where p > a is any whatever large fixed number since 


p 
the difference of these integrals is also an improper integral. Р» 
33—9505 
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Example. Determine whether or not the improper integral 


To 


2x! + 1 
х + 3х +4 
1 
converge. 
: 217351). T : 
< The integrand f(x) = ————— —- is positive for all x > 1. Write 
X +3x+4 


down f(x) as 


24 vx 


200556 rau 





Whence we conclude that for large x the function f(x) is similar to the 
function 2/x. 
To + о 
1 : : ах | 
Put ф(х) = x and consider the integral Hove ф(х) dx. This 
1 1 
integral diverges and by virtue of Theorem 11.2 so does the original integral 
since 
2 
JU) _ um E AE ү » 
x c p(x) хэ + X + 3х + 4 + 


: : : ах 
Theorem 11.1 along with the results concerning the integral | E 





+o 1 


lead to the following convergence test for the integral | f(x) dx involving 


a nonnegative function f(x). г 


Theorem 11.3. Let there exist а number а > 1 such that for all suffi- 
ciently large x 


0 < w «M, 
X + 


where M is independent of х and М > 0. Then the integral | Ро) dx con- 


verges. Е 


If f(x) 2 M where M is independent of x and M » 0, whenever x is 
+o 


sufficiently large, then the integral | f(x) dx diverges. 





Me > 1) for all x 2 А > max (a, 0]. Since the in- 


4 Let 0 < fe) < — 
To C x 


: M. 
tegral | Ёо dx where a > 1 converges then putting ф(х) = > in The- 
e : 
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+ «© 
orem 11.1, we conclude that so does | Ло) ах. Whence it follows that 
+ А 
| Лх) dx converges, for 


a 
b 


| fo) dx = Ї Ло) ах + 


а а 


b b 
and the integrals | f(x) dx and | 

а A 
limits as b > + œ. 


f(x) dx 


RT 


~ 


(x) dx must simultaneously have finite 


Now let f(x) > M (M » 0) for all x 2 A » maxía, 0]. Since tbe in- 
+ c 


tegral | M, dx diverges then, by virtue of Theorem 11.1, so does the in- 





+ + = 
tegral | Ах) dx; this implies that the integral | /(х) ах also diverges. > 
A 
+ о : 
Example. Investigate the integral 2582 dx. 
р E E Lx HIS: 7 
< For x z 3 we have ? 
ox x—2 2 x _ 1 
xt x? + 2x 45 x xt 
+o 


The integral | а converges since а = 2 > 1; hence so does the 
x 


original integral. » 


11.3 Absolutely Convergent Improper Integrals 


Let a function f(x) be defined for all x > a and integrable on every 
[a, b] where D> a. 
+ < 
Definition. The integral | f(x) ах is said to converge absolutely if the 


+ о E 


integral | |f(x)| dx converges. 
a +œ +0 
If the integral | f(x) ах converges and the integral | ГОО) dx diverges 


a a 
to 


we say that the integral | f(x) ах is conditionally convergent. 
a 


33* 
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+œ 
Theorem 11.4. If the integral | | f(x)| ах converges so does the integral 
T a 
| f(x) ах. 


а 
+0 


<4 Let | (/00| dx converge, i.e., 


a 


b 
lim | |(х)| ах = 1, < + о. 
Бә + 2 


Since given апу х іп the domain of f(x) there holds 


- Ро) < Дх) < |f?) 
then 
0 « |f) + SO) < 2170). (9) 


+o 
The integral | |/œ)| dx converges by the hypothesis; hence, so does 


a 


To to 
the integral | 2|f(x)| dx - 2 | [/G) dx. Then by virtue of (*) Theo- 


+o 


rem 11.1 yields that the integral | U(x) + [/(x)]) dx converges. This means 
b a 
that the integral | СОО + |700) ах has a finite limit as b > + co. 


Evidently, we have for all x 2 a 
Јо) = (о) + [Лор — IFO. 
Whence for any b > a there holds 


b b b 
| лах = | Оо) + [60D dx — | [feo] dx. 


a a a 


Both integrals on the right have finite limits as b > + œ. Hence, the integral 
b To 


| f(x)dx also has a finite limit as b > + œ, ie., the integral | f(x)dx 
a a 
converges. 

+otheorem 11.1 and the results concerning the integrals of the form 


+ © 


| E leads to the following convergence test for the integral | Дх) ах. 
х 


а 
! Theorem 11.5. Let there exist a number а > 1 such that for all suffi- 
ciently large x there holds 


бә] « -M., 
X 
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+ о 
where M is independent of x and М > 0. Then the integral | Jf) dx con- 


verges absolutely. М 


< Suppose that the above condition holds for all x 2 А > max fa, 0}. 


+o 


Since the integral | —, dx where a > 1 converges, then by virtue of The- 
x 


A + о 
orem 11.1 so does the integral | 1700) dx. This implies that the integral 
A 
+ +o 


| 700) dx converges and, consequently, the integral | f(x) dx converges 
a a 
absolutely. 


If œ| > 


+o 
M : 
—, the integral | Дх) dx does not converge 
x 


a 


absolutely В» da 





sinx 
x 


;— dx converges absolutely since 


For example, the integral | - 
1 


+ о 


sin x 


x 





. dx 
< 2 for all x, x > 1, and the integral | —5- converges. 
x x 








1 
+o 


Thus we infer that the integral | f(x) dx converges if it converges abso- 
a 
lutely. The converse is not true: not every convergent integral converges ab- 
+ о 














sinx : 
solutely. For example, the integral | dx being convergent does not 
converge absolutely. 1 
<4 Indeed, applying integration by parts, we obtain 
+ о +0 + со 
E +o 
| six y | Тараг _ совх | шээг dx 
x x x 1 x 
1 1 1 
+o 
cos x 
= cosl — —,— dx. 
| 5 
+0 1 


. COS X s 
The integral | 5 — dx converges absolutely; hence, it is convergent. 
x 





1 
Thus both expressions on the right of the above identity are finite so that 
(1) the method of integration by parts is justified and (2) the integral 
To 


sinx 





on the left is finite, i.e., the integral | dx converges. 


1 
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+ о 


sinx 





Now we show that the integral | dx is conditionally conver- 


1 + < 
{sin x| 





gent. In other words we show that the integral | dx diverges. 


1 
Notice that for any b > 1 the inequality 


: 1- 2x 
[sin x| > sin? x = — у 
implies that 
b | | b j b 
sinx 1 x 1 cos 2x 
Meira Зэс бае ENDE 
| x gx 2 | х 2 | x de e 
1 1 
*o b 
Е dx TENE 
The integral | E diverges, for lim | = К^ Applying in- 
b> + о 


dx, we infer that it converges. 





: : cos 2x 
tegration by parts to the integral x 
Evaluating the limit of (+) as b > +œ, we conclude that the right-hand 
side of (*) and, hence, the left-hand side of (*) diverge to infinity. This 

+o 


7 sin x : : 
means that the integral | deine]: dx diverges, whence it follows that the 
x 


1 
+a 


ds : sinx : 
original integra] | E E dx is not absolutely convergent. ® 


1 
Now we turn our attention to Dirichlet's convergence test which speci- 
fies the sufficient conditions for the improper integral to converge. 
Theorem 11.6 (Dirichlet's convergence test). 7f a function is continuous 
and has a bounded antiderivative F(x) for x 2 a.and if a function g(x) 
is continuously differentiable and monotonically decreases for x > a and 


if lim g(x) = 0, then the integral | JO) g(x) dx converges. 
x> to 


a 


The proof of this theorem can easily be found elsewhere in the relevant 
literature. 
By way of illustration we apply Dirichlet’s convergence test to the in- 
+ о 


sin x 
tegral | = dx, a> 0. 
x 


1 
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ча The function f(x) = sin x has the bounded antiderivative F(x) = —cos x 
for all x; g(x) = I/x* (a > 0) is a continuously differentiable function for 
x > land g(x) monotonically decreases and tends to zero as x > со. Hence, 
all the conditions of Theorem 11.6 are satisfied so that the integral in ques- 


tion converges. В» 
+o 


Problem. Show that the Fresnel integral | sin x? dx converges. (Hint. 


0 
Use the substitution х? = t.) 


11.4 Cauchy Principal Value of the Improper Integral 
Е 
We define the improper integral | f(x) dx by writing 


| Ах) dx = мн | f(x) dx 


b 
and say that | f(x) ах converges if this limit exists; otherwise the improper 
integral is said to diverge. 

If both the lower and upper limits of integration are infinite we set 


+ b 
| FQ) dx = um. | Ло) dx 
Or 
+ = М, 
| de= lim [feds 
29 NIC -N 


where № and № diverge to + oo independently. It may happen that the 
improper integral thus defined has no meaning while there exists its Cauchy 
principal value (С.рм.) given by the formula 


+ N 
С.рм. | fo dx = „іт | f(x) dx, 
-w *9 DN 


+o 
ie, when N; = N; = N. Then we say that the improper integral | Дх) ах 


converges in the sense of the Cauchy principal value. 
+a 
x dx 


Example. Investigate the integral | т 
+x 
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М. 
2 
44 We have хо Erin шы М; , whence it follows that the in- 
1+х 2 1-М, 
Ñ, 





N, 
xdx 125 
tegral re has no limit as № > +оо and № > +œ so that the 
x 
-M 
+ c 


integral | хех, diverges. On the other hand we have 
+x 





-œ 


+ с М 


ee хіх am хіх 14, 1+N EP 
TB que. aen V ue Оу TU 
2м 





Hence, the integral in question converges in the sense of the Cauchy prin- 
cipal value. » 


11.5 Improper Integrals of Unbounded Functions 


The necessary condition for the definite (Riemann) integral 
b 


| f(x)dx to exist is that the function f(x) be bounded on [a P]. 


a 

For example, if f(x) is integrable on [a, bi] where b, < b and is unbounded 
іп a neighbourhood of x = b then the definite (Riemann) integral of f(x) 
does not exist on [а, b]. Thus we need some concepts and definitions to 
generalize the notion of the definite integral to that involving an unbounded 

integrand. 
Let f(x) be integrable on [a, b — є] for any whatever small ғ > 0 and 
be unbounded in (5 — є, Б) (Fig. 11.3). To deduce what we mean by writing 

b 


| Ло) dx 


a 


we consider the function in € (e > 0) 


е) = T уо) dx. 


a 


Definition. If the function J(e) has a finite limit L as = > 0 + 0 we 
b 


say that the improper integral | Ах) dx converges and set 
a 


b =E 
| F(x) dx = jim, T Дх) dx = І. 


a a 
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If the function J(e) has no limit as e > 0 + 0 the improper integral 
b 
| f(x) dx is said to diverge; in this case no numerical value is assigned 


a 


to this integral. 


1 
Example. Investigate the integral | Би Рон 
J VI-x 


1 
ча The integrand f(x) = ——— is a continuous function on an 
8 I zx y 


10, 1— e] where € > 0. Hence it is integrable. On the other hand 











y 
і 
у= (х) 
| 
+ T ————— = 
0 а 0-5 b T 
Fig. 11.3 
f(x) ^ +æ as x 1 — 0. Then we have 
las 
. dx 2 . --1 — einai 7 
lim. | Лу = um. (sin (1—6e)-sin 1l- 2? 


0 
so that the improper integral in question converges. Notice that the substi- 


7,2 
: : с : : cost 
tution x = sin reduces the original improper integral to rs 


1,2 [U 
whence we obtain the definite (Riemann) integral | dt — 5 by putting 
0 


the integrand equal to 1 at t = 5 
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Analogously, if a function f(x) is unbounded only in (a, a + є) where 
b 
є > 0 we define the improper integral | Ах) dx ав 


a 


b b 
| Л) ах = lim, | Ло) dx. 


This integral is called convergent if the finite limit on the right exists; other- 
wise it is divergent. 
1 




















Example. Investigate the integral | x : 
0 
ча By the definition we have 
1 1 
| ах . | dx 
— = lim : 
xe £2040 x? 
0 Е 
Since for а z 1 there holds 
1 
ах xc 1523 gpl-9 
x? l-a |, l-a 1-а 
Е 
еп 
1 
? | dx 1 
lim m—— o o 
£040 x? l-a 


provided that a.< 1. 
1 


If a > 1 the integral | -ZX was no finite limit as €> 0 + 0. 
x 


€ 


Let a = I. Then 





1 1 
dx dx 
a = — = -Iné> +œ ase>0+0. 
x x 

Е € 


Thus the integral 


1 
dx (converges if a< l, 
x“ (diverges if a2. 
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If the function f(x) on [a, b] is unbounded only in some neighbourhood 
of the point c, where a « c « b,then we set 


b c b 
| 70) dx - | fe) dx + | feo dx 


c-é, ? 
= EN | Fx) dx + ү feo dx} 


We leave it to the reader to examine other discontinuities of f(x). 


11.6 Improper Integrals of Unbounded Nonnegative 
Functions. Convergence Tests 


Theorem 11.7. Let f(x) and (x) be integrable functions on 

[a b — є] for any whatever small positive € and be unbounded in 

(b — є, b). Let 0 < f(x) < (x) on [a, b). Then 
b 


(1) if the integral | e (x) dx converges so does the integral | f(x) dx; 


a 


b 
(2) if the integral | f(x)dx diverges so does the integral 


a 


ф(х) dx. 


Reo aco 


b 
“Let the integral | еб) dx converge so that there exists 


a 


im, | ф(х) ах = L. 
b 


We show that the integral | Лох) dx converges, i.e., that the function 
b-e a 
J(e) = | f(x) dx has a finite limit as € ^ 0 + 0. Indeed, since f(x) > 0 


a 
on [a, Б) then given any ғ > 0 (ғ < Б — a) the function J(e) in nonnegative 
and nondecreasing as є decreases. Also, if f(x) < ф(х) for all x є [a, b) then 
given any € > 0 there holds 


bre b-t 
| Ах) dx < | e(x) dx. 
a a 
b- 
The value of the integral | i ф(х) dx does not exceed that of the conver- 
b a 
gent integral | ф(х) dx; hence, given апу є > 0 we have 
a 


= b 
J(e) = í Дх) dx < | ф(х) dx = L. 


a a 
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Therefore the function J(e) is nondecreasing as є > 0 + 0 and is bounded 
above. This means that the finite limit of J(e) exists as € ^ 0 + 0 and, by 
b 


virtue of the definition, the improper integral | f(x) dx converges. 
a 
The second part of this theorem can easily be proved by contra- 
diction. » 
Theorem 11.8. Let f(x) and p(x) be positive functions on [a, b) and 
let f(x) and e(x) be unbounded only in some neighbourhood of the point 
x = b. Let there exist 


Јо) = к> 0 
x2b-0 ф(х) ` 
b b 
Then both integrals | Тод) dx and | ф(х) dx either converge or diverge. 


We now turn to the absolutely convergent improper integrals involving 


unbounded functions. 
b 


Definition. The integral | J(x) dx is said to converge absolutely if the 
b a 
integral | 1/00| dx converges. 
a 
As in the case of the improper integrals with infinite limits of integration 


the following theorem is true. 
b 


Theorem 11.9. 7f the integral | |f(x)| dx converges so does the integral 
b a 
| f(x) ах. 
а 
Applying this theorem we can easily prove the following convergence 
test for the improper integrals involving unbounded functions. 
Theorem 11.10. Let f(x) be a functon unbounded only in (b — є, Б) 
where € > 0 is any whatever small number. Let there exist a positive number 
a < 1 such that for all x sufficiently close to b, x < b, there holds 


M 
X) S ————á» 
100 € c s | 
where M is independent of x апа М > 0. Then the integral | f(x) dx con- 
verges absolutely. # 


Problem. Show that if given any x sufficiently close to b, x < b, there 
holds 


fe) > 





M 
b М> 0, 


X , 


then the integral f(x)dx does not converge absolutely. 


A eem | 
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Remark. The substitution b — x = 1/t or x а = 1/t reduces the im- 
proper integrals of unbounded functions to those with infinite limits of 
integration; so the major results concerning the former integrals can be 
deduced from the theory of the latter. 


11.7 Cauchy Principal Value of the Improper 
Integral Involving Unbounded Functions 


Let f(x) be an integrable function on the closed intervals [a, c — e] 
and [c + £, b] where e£ > 0, a < c < b and let f(x) be unbounded in some 
neighbourhood of the point c. As before, we can write 


b с b 
| fœ) dx = | fe) dx + | fe) dx 


c-6, b 
= lim { | Ах) dx + | f(x) ax}. 
£j 2040 2 нэ 
Є2-00-0 
Notice that the limit on the right must exist as є; and & tend inde- 
pendently to zero. 
Sometimes the improper integral thus defined has no meaning while 
there exists its Cauchy principal value given by the formula 


сЕ b 
C.py. Ї Ах) dx = Jim, Í | Дх) dx + | То) a ; 


a Ctt 


where е > 0 is the same in both integrals on the right. In this case we say 
that the improper integral converges in the sense of its Cauchy principal 
value. 


where c € (a, b). 





b 
Example. Investigate the integral f(x) — | 
a 











х= с 
за We have 
5 b Е 
"UC ах ах AUS 
+ —— = (ln |x - с) 
x-c x-c à 
a с+є, 
b 
b-c E 
+ (In |x — c]) = Іп +In—., 
C£, c-a £ 





The limit of the right-hand side as both ғ and € in an arbitrary manner 
tend to zero does not exist. Put ғ; = £2 = £. Then as є ^ 0 + 0 the limit 
of the right-hand side exists and is equal to the Cauchy principal value 


526 11. Improper Integrals 





of the original integral so that 
b 





C.p. | ах = In 2. a«c«b. т 
х- с с-а 

а 
Problem. Let a function f(x) be defined in the neighbourhood 
(—R, R) of the point x = 0 except probably at х = 0 and let f(x) be un- 
bounded as x > 0. It is known that any function f(x) can be represented 
in a neighbourhood of x = 0 by the sum of even and odd functions as 


fi) = OAD , ЛӘ ФЛ) py + б 


R R 
Show that C.p.. | f(x) dx exists if so does the integral | a(x) dx 
-R 0 


+ 2-2 
where № (х) is the even function MERE EE 
Exercises 


Using the definition of the improper integral determine whether or 
not the following integrals converge 





to + о + = 
хах х +1 155 dx 
1. M EE dx. 3. d 4. po 
| fer 2 | 3 3 | xe “ах | s 
1 2 
Determine whether or not the following integrals converge 
to d Too + © 2 
хах xli х dx 
5. —————-——. 6. ~ ах, 17. ————À———. 
| x-x4l | у +х+5 | x'-x4xl 
2 
teo d — = 1 - ео 
х x tan” x - dx 
8. | —L————. 9, | a dx. 10. | — — (a is a real 
Mx ЗГАА хіп“ x 
Y x +1 : x' +1 у 


* 


a 


+? In (1 + 2) 
number). 11. | —— dx. 


1 
+ 


12. Evaluate the improper integral | x?^ tle? dx where n is a positive 
0 
natural number. 
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13. Show that 


+ o +o 
C.py. | sinxdx = 0 and C.p. | 5. = 0. 


Using the definition of the improper integral determine whether or not 
the following integrals converge 





2 1 
dx 1 
14. ————. 15. xInxdx. 16. 
| Vx — 1 | | 

0 


0 
dx el 
a 17. | P dx. 
x - 
1 el” r dx 
| EE dx. 19. | “үс 
0 


-1 





Investigate the following integrals 


1 1 1 


20. -= 21. | UE 22. | ax 
1 


= х“ : 3 (1 = xy А Кр _ | 
К To X 
ji | uM | n OX ax, a >20. 
ех — cosx m 
3 0 
+o " et Е 
25. =a dx (a and 8 are real numbers). 26. cos x dx. 
E x+ 
: 0 
Answer 


1. Converges; equal to 7/4. 2. Diverges. 3. Converges; equal to 1/2. 4. Diverges. 
5. Converges. 6. Diverges. 7. Converges. 8. Diverges. 9. Diverges. 10. Converges for a > 1. 


! 
11. Converges for a > 0. 12. ~~. 14. Converges; equal to 2. 15. Converges; equal to —1/4. 


16. Converges; equal to 3/2. 17. Converges; equal to —2/e. 18. Diverges. 19. Diverges. 

20. Converges. 21. Diverges. 22. Converges. 23. Diverges. 24. Solution. The integral in question 

is a union of the integral with infinite limits of integration and that involving unbo- 
-1 

unded functions. Indeed, its upper limit is infinite and its integrand f(x) — an is 
x 


not defined at the point x = 0 and is unbounded as x > 0 for sufficiently large а > 0. We 
divide the interval of integration into two parts so that the first subinterval would contain 
the point of discontinuity of f(x) at x = 0 and the second one would be appropriate to examine 
the behaviour of f(x) as x > + «©. Consider, for example, the semiclosed intervals (0, 1] and 
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+ = To 

tan^!x tan !x tan^!x 
| P dx = | x dx * | xs dx. 
0 1 


Consider the first integral on the right. Recall that tan ! x ~ x as x — 0; hence, 
tan !x . ||, 
)- EU TEE is similar to 


Дх 





near the point x = 0. This implies that for the integral 


xt! 


tan^! 
| S dx to converge it is necessary that a — 1 < 1 or œ < 2. The integrand 


0 
-1 


t В : ЫС 22228 
fo = шин of the second integral on the right behaves similarly to the function 2 





: Е т us eke 
asx > + х since tan” x > 2 as x > + о. For this integral to converge it is necessary that 


a > 1. Combining both conditions we infer that both integrals on the right converge only 
if 1 < a < 2; this is the condition for the original integral to converge. 25. Converges if 


а> -land f - a > 1. 26. Converges conditionally. Hint: apply the Dirichlet convergence 
test. 


Chapter 12 


Functions of Several Variables 


12.1 Basic Notions and Notation 


The notion of a function of one variable is somewhat inadequate to 
model relations and phenomena we come across in the surrounding world. 
Many real-world functions depend on two or more variables. For example, 
the volume of the rectangular parallelepiped with edges x, y and z is given 
by the formula V — xyz where x, y and z are all positive. In this case we 
have four quantities; the value of V is fully determined by the values of 
three other variables x, y and z. This is one of the numerous applications 
of the notion of a function of several variables; the major topic of this 
chapter is the calculus of functions of several variables. 

Let R” be the n-dimensional Euclidean space and let M’ (xi, xz, ..., 
хх) and M’(xi’, x’, ..., x) be two points in this space. We let 
o(M', M") denote the distance between М” and M" and define 


n 
e(M', M') - |X - xy. 
k=1 


If n = 1 this formula gives the distance o(M', M") = |xi' — xi | be- 
tween the points M’ (xj) and M” (x(') of a line; putting n = 2, we obtain 
the distance o(M', M") = NT — xi Y. + Gd! — xi Y. between the points 
М (xi, х3) and M" (xf, ху) in a plane. . 

Definition. Let Mo(x?, x2, ..., х0) € R” and let € be a positive real num- 
ber. The set of all M € А” such that o(M, Mo) < € is called the n- 
dimensional open ball with centre at Mo and radius є. 

If n = 2 we have (x — xo)? + (y + yo)” < &?. This expression defines а 
circular disk with centre at Mo(xo, yo) and radius є. Notice that the circle 
bounding this disk is not taken into consideration as shown in Fig. 12.1. 

For n = 3 we obtain a ball with radius € and centre at Мо(хо, Yo, 2) 
specified by the equation (x — xo)? + (y — yo)’ + (Z — 20) < &^. Again 
the bounding sphere is out of consideration (Fig. 12.2). 

We shall also consider a somewhat different neighbourhood of a point 
Mo, xX, ..., x8) that can be called a rectangular neighbourhood. We 
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Fig. 12.2 


es 
m 





Fig. 12.3 


12.1 Basic Notions and Notation 531 


mean here a set of all points M(x, X2, ..., Xn) іп R” such that 
xài-ag«x«x + Gi, & > 0, і = 1, 2, ..., n. 


Putting n = 1, we obtain the e-neighbourhood xo — € < x < xo + € of 
the point x well familiar to us from the calculus of functions of one varia- 
ble; for n — 2 we get the plane figure bounded by the rectangle with sides 
2Е: and 2e» (sides are excluded, Fig. 12.3) and for n = 3 we have the (open) 
solid bounded by the parallelepiped with centre at Mo(xo, yo, zo) and edges 
281, 2&2 and 2e; (Fig. 12.4). 











Fig. 12.4 


Definitions. Let E C R". The point M € E is called the interior point 
of the set E if there exists є > 0 such that E contains M along with its 
&€-neighbourhood. 

The set E is called the open set if E contains only interior points. For 
example, for n — 2 any circular disk is an open set. 

The point P, P « А", is called the boundary point of the set E C R” 
if any neighbourhood of P contains both points belonging to E and those 
not belonging to E. 

The set of all boundary points of E is called the boundary of E; we 
denote the boundary of E by дЕ. 

__ The union of the set E and its boundary дЕ forms the closed set 
E = EUGE. For example, the union of a circular disk and the circle bound- 
ing the disk is a closed set. 

The set E C А" is called connected if given any two points in E a con- 
tinuous curve (or a polygonal line) joining these points is fully contained 
only in Е (Fig. 12.5a). Otherwise the set is called unconnected (Fig. 12.55). 


34* 
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An open connected set is called a domain. A domain is said to be 
bounded if there exists a ball that contains this domain. 

Any domain that contains a given point Мо is called a neighbourhood 
of М (in distinction from the -neighbourhood of Mo). 

Notion of a function of several variables. Suppose that there is known 
a rule by which every point M(xi, X2, ..., Xa) of aset E in the n-dimensional 
Euclidean space A" is associated with a real number u. Then we say that 
a function of the point M or a function of n variables xi, x», ..., Xn is 
defined on E and write 


u = f(M) or u = f(a, x, ee eg Xn), MEE. 
The set E is called the domain of the function f. 


(a) The connected set (b) The unconnected set 
Fig. 12.5 


We shall mainly confine our analysis to the functions of two variables 
z = f(x, у); the major results can easily be generalized to functions of more 
than two variables. 

When a function is specified analytically by a single formula and its 
domain is not indicated in advance we shall regard the set of all points 
M(x, х», ..., X»), where the formula attains a finite real value, as the do- 
main of this function. For example, the domain of the function z = x + y is 
the xy-plane and the domain of the function z = V1 — x? — y? is the circle 
хус! 

Let a function z — f(x, y) be defined on a domain E in the xy-plane. 
Then every point (x, y) € E corresponds to the point (x, y, f(x, y)) in the 
3-dimensional space. The set of all points (x, y, f(x, y)) where (x, y) € E 
is called the graph of the function z — f(x, y). For example, the paraboloid 
of revolution is the graph of the function z = x? + у? (Fig. 12.6). 

To investigate the behaviour and visualize the shape of the function 
z — f(x, y) it is helpful to turn to /evel curves. The level curve is a set of 
points in the xy-plane where the value of the function is constant so that 
z=f(x, y)=c. It can be constructed by intersecting the surface z=/(x, .y) 
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with the plane z=c parallel to the xy-plane as the vertical projection 
of the line of intersection into the xy-plane. A collection of the level curves 
fio y) = Cm, m = 1, 2, ..., К, where Cm+1 — Ст = h = const, provides a 
useful information about the function. It is easily seen that the closer the 
level curves to each other the higher the rate of change of the function. 
For the function z = x? + y? its level curves are circles centred at the origin 
of coordinates (Fig. 12.7, where Л = 1). 








Fig. 12.6 Fig. 12.7 


In the case of functions of three variables we can turn to /evel surfaces. 
The level surface of the function u = f(x, y, z) is the set of points 
M(x, y, z) in space where the value of u = ДМ) is constant. For example, 
the level surfaces of the function u = x? + y? + z? are spheres centred at 
the origin of coordinates. 


12.2 Limits and Continuity 


Definition. Let /(М) be a function defined in some neighbourhood 
О of the point Мо(хо, yo) except probably at Mo. The number А is called 
the limit of f(M) at the point Мо(хо, yo) if given any number є > 0 there 
exists a number ô > 0 such that for all points M(x, y) €Q distinct from 
Mo(%, yo) there holds 


IM) -A|«e 
whenever 0 < о(М, М) < ô. 
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In symbols we write 


A= lim ХМ or A = lim f(x, y). 
Мм-м, хх 


Уу 


We assume here that the point M may tend to Mo in an arbitrary mode 
(along any arbitrary direction or subject to any arbitrary law) and all the 
limiting values of f(M) thus obtained must be equal to the number A. 
Examples. (1) The function f(x, y) = x^ + y? is defined on the xy-plane 
and /(0, 0) = 0. We show that the limit of this function at the point 
ООО, 0) is equal to zero. 
94 Consider any є > 0. Then the condition |f(x, y) — 0| < е becomes 
|x? + y? - 0| < £ or|x?  y?| < є. Noting that V x^ + y? = r(M, 0) where 
M(x, y) is a point with the coordinates (x, y), we may write |х? + y!| < е 
as о(М, 0) < € or о(М, 0) < Ve. Put ô = Ve. Then given апу М(х, у) 
such that o(M, 0) < à = Ve we have |х? + y? — 0| < e or |f(x, y) - 0| < € 
(Fig. 12.8). By the definition of the limit this means that the number A 
is the limit of the given function at the point O(0, 0). » 





Fig. 12.8 


(2) The function f(x, y) = v is defined everywhere except at the 
х y 


point O(0, 0). We investigate the behaviour of f(x, y) provided that 
(x, y) tends to zero along the lines y = kx, x # 0. 

ча The lines defined by the equation y = kx pass through the origin of 
coordinates. We have 


2х2К 


iig 5 


Лх, kx) = 
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Whence 


f(x, kx) o ur as х > 0 


+k 
so that for different values of k the limiting values of the function are 
different. This means that the given function has no limit at the point 
O(0, 0). » 


2 
(3) The function f(x, y) = P is defined everywhere on the xy- 
x y 


plane except at the origin O(0, 0) of coordinates. Again we investigate the 
behaviour of this function provided that (x, у) approaches zero along the 
lines y = kx, x # 0. 
4 We have 
kx? 
fi, Кх) = age” x 0 

so that f(x, kx) ^ 0 as x 0. Hence, f(x, kx) has a limit equal to zero 
for any line y = kx, i.e., for any line along which the point tends to the 
origin of coordinates. If we put y = x? then f(x, x?) = 1/2, x # 0. This 
means that the limit exists when a point tends to the origin by moving 
along the parabola y — x?. However this limit is equal to 1/2. Thus the 
given function has no limit at the point O(0, 0). » 

Theorem 12.1. Let f(M) and Ф(М) have limits at the point Mo. Then 
at Мо there exist the limits of the sum AM) + Ф(М), the difference 


АМ) — e(M), the product f(M)e(M) and the quotient o (provided 
that im. Ф(М) = 0) and 





lim (АМ) + e(M)) = lim ДМ) + lim (М), 
MM; MM; Мә М 

lim (M): Ф(М)) = lim ДМ). lim (М), 
M^ м, MM, M > Mo 


lim f(M) 


.. JM _ mom : 
adn. eM) — jim (М) Cum MD даг 





It is sometimes helpful to use the following definition of the limit which 
is equivalent to the preceding one. 

Definition. Let a function /(М) be defined in some deleted neighbour- 
hood Q of a point Mo, i.e., in some neighbourhood of Мо which does not 
contain Mo. The number 4 is called the /imit of the function AM) at Mo 
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if given any sequence of points {M,} that converges to Мо (М, € Q, 
Mn » Mo) the corresponding sequence | f(M;)] of the values of f(M) con- 
verges to A. 

Remark. The above notion of the limit implies that all variables simul- 
taneously tend to their limiting values, ie, (x, у) —^ (xo, yo). Sometimes 
we have to deal with limits obtained by tending variables to their limiting 
values in succession; there are salient features of functions of several varia- 
хээ? 

х + у? 
defined everywhere except at the point O(0, 0). Tending x to zero for 
y = const # 0, we have lim f(x, y) = -1. If we put now x = const # 0 and 

Үс 


bles. By way of illustration we consider the function f(x, y) = 


tend y to zero then lim f(x, y) = 1. Thus we have 
у-0 
lim [lim f(x, уу) # lim [lim f(x, у). 
у-0 x70 x70 узб 


This means that the limit of f(x, y) depends upon the order in which we 
tend the variables to their limiting values. 

Definition. Let a function /(М) be defined at a point Mo(xo, yo) and 
in some neighbourhood Q of Mo. The function f(M) is called continuous 
at Мо(хо, yo) if im ЛМ) = f(Mo) or, equivalently, if lim f(x, y) = 

^ Mi X> Xo 
y 7 yo 
Лоо, yo). We assume here that the point M(x, у) approaches Мо(хо, yo) 
in an arbitrary mode, being always contained in the domain of f(M). 

Using the “sô” definition of continuity of a function at a point we 
say that a function f(M) defined at the point Mo апа in some neighbour- 
hood Q of М is continuous at Mo if given any е > 0 there exists ô > 0 
such that for all points M € there holds |f(M) — f(Mo)| < = whenever 
e(M, Mo) < ó. 

We can also give slightly different definition of continuity of a function 
at a point Mo. We let Ax and Ay denote the increments of the independent 
variables x and y in moving from the point Mo(%, yo) to the point 
M(x, y). Let Az = f(xo + Ax, yo + Ay) — Ах, yo) be the increment іп z = 
J(x, y) correspondng to Ax and Ay. Then the expression 


lim f(x, y) = f(Xo, у) 
X> Xo 
У Уе 
becomes equivalent to 
lim Az - 0, 


4х-0 
Ay 20 
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which describes the condition for the function z = f(x, y) to be continuous 
at the point Мо(хо, yo). We assume here again that the increments Ax, Ay 
can approach zero in an arbitrary manner independently of each other. 

The preceding notion refers to the continuity of a function with respect 
to every and all variables; it implies that if a function f(x, y) is continuous 
at some point Mo(xo, yo) then f(x, y) is continuous at Mo(xo, yo) with respect 
to botn the variable x and the variable y. 

However the converse is not always true; if a function f(x, у) is continu- 
ous at some point Мо with respect to both x and y f(x, y) is not necessarily 
continuous at Mo. To illustrate what we are speaking about we consider 
the function 





22 2 xX + y* £0, 
x+y 

x, у) = 

Дх, y) 0, — 


Clearly, we have f(x, 0) = 0 for all x, so that lim f(x, 0) = 0 = 
x70 


J(0, 0). Hence, f(x, 0) is continuous with respect to x at x = 0. Similarly, 

the function /(0, у) is continuous with respect to y at y = 0 ѕіпсе f(0, у) = 0 

for all y so that lim /(0, y) = 0 = f(0, 0). However, the function f(x, у) 
yoo 


is not continuous at the point O(0, 0). Indeed, let y = x. Then 


2x? 
х-у-0 х-0Х-4Х 


This function fails to exhibit continuity since we have assumed that the 
variables x and y approach their limiting values in moving along the x- 
and y-axes only and have left aside infinitely many other directions from 
which (x, y) may approach the point O(0, 0). 

Theorem 12.2. If both f(M) and e(M) are continuous functions at the 
point Mo so are the sum (M) + o(M), the product f(M)e(M) and the 


difference f(M) — Ф(М) and so is the quotient JM) 


e(M) 
€ (Mo) » 0. 

If a function ДМ) is continuous at every point of the domain D, f(M) 
is called continuous on the domain D. The point where the function /(М) 
is not continuous is called the discontinuity of f(M). Discontinuities of 
f(x, y) can either be isolated or fill in some curves. For example, the func- 





provided that 


tion f(x, y) = P has the only discontinuity O(0, 0) while discon- 
x 


tinuities of the function f(x, y) = | z form the lines y = x and 
cu 


х2 


y= —Хх. 
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Theorem 12.3. If the function f(M) is continuous on the closed bounded 
domain D then f(M) is bounded on D and attains its absolute maximum 
and absolute minimum in D. 

This theorem is a generalization of the respective theorems for functions 
of one variable (see Sec. 7.3). 


12.3 Partial Derivatives and Differentials 


Let a function z = f(x, y) be defined on some domain D in the 
xy-plane and let (x, y) be some interior point of D. Consider the increment 
Ax in x such that (x + Ах, y) « D (Fig. 12.9). 


| 











vel 


Fig. 12.9 


The increment 
Axz = fx + Ax, y) - f(x, у) 


is called the partial increment іп 2 caused by the increment Ax in х. 


Ад 


We let Ах denote the ratio of the partial increment in 2 to the respec- 





tive increment in x; clearly, this ratio is a function of Ax. 
Axz 





Definition. If the ratio has a finite limit as Ax > 0 this limit is 


called the partial derivative of the function z = f(x, у) at the given point 
(x, y) with respect to the independent variable x. 

We shall denote the partial derivative of z = f(x, y) with respect to x 
by writing 


д2 ? А 
ux OF Хх, у) or 24, у). 
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Therefore we put 


Эс сүрэг 
Ox Ebr Ax 





or, equivalently, 





f(x + Ax, y) - Дх, у) | 


(x, y) = lim 
ТАХ, у) m. Ax 
Analogously, we have 


az im Z lim fœ, у + Ay) – Дх, у) | 
ду ду-о Ay Ay 0 Ay 








Let u = fGa, X2, ..., Xa) be a function of n variables. Then 





ðu lim SO Xy- Xk- 1Xk + AXk Xk + 1,X8). — ЛОХ1,Хэ, Xk — 1 Xk ss Xn) 
дхк дк, -о Ах, { 


Noting that A,z апа A,z are computed by regarding x and y, respective- 
ly, as constants, we can give the following definitions of partial derivatives. 

The partial derivative of a function z — f(x, y) with respect to the varia- 
ble x is an ordinary derivative with respect to x computed by regarding 
y as a constant; similarly, the partial derivative of z — f(x, y) with respect 
to y is an ordinary derivative with respect to y computed by regarding x 
as a constant. Whence it follows that ordinary and partial derivatives are 
subject to the same laws of differentiation. 

Example. Compute the partial derivatives of the function z = e”. 
ча We have 


Oz 


nu cO and =: > 


ду 

Remark. If a function z = f(x, y) has partial derivatives with respect 
to every variable at some point, z — f(x, y) is not necessarily continuous 
at this point. For example, the function 


Lo p X +y #0, 
х? + у 
Лх, у) = 0 x=y=0 


is not continuous at the point O(0, 0) but has both a partial derivative 
with respect to x and that with respect to y since f(x, 0) = 0 and /(0, y) = 0 


so that oF = 0 and of = 0. 
ax ду 


(0,0) (0,0) 
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Geometric interpretation of partial derivatives. Consider a surface S 
specified by the equation z = f(x, y)in three—dimensional space where 
f(x, y) is a continuous function having partial derivatives on some domain 
D. We wish to interpret geometrically partial derivatives of f(x, y) at a point 
Mo(%, yo) € D, which corresponds to the point No(%, уо, f(x, yo)) on the 
surface z — f(x, y). 

On computing the partial derivative Le at Mo we think of z = 
f(x, y) as a function of one variable x and treat y as a constant y = yo, 
ie, z = f(x, yo) = fil). 

The function z = fi(x) specifies the curve-Z obtained by intersecting 
the surface S with the plane y — yo (Fig. 12.10). Recalling the geometric 





Fig. 12.10 


interpretation of ordinary derivatives, we can write /1(Х0) = tana where 
a is the angle between the x-axis and the tangent to the curve L at the 


point No. Since f1(xo) = (=) then (55) | = tan a. 
Ox 
(х, Yo) Mo 


Ox 
is the slope of the tangent at No to the curve 
Mo 





02 
Thus (= 





formed by intersecting the plane y = yo with the surface z = f(x, y). 
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Oz 


Analogously, (=) = tan 6. 





Mo 
Differentiable functions. Let a function z = f(x, y) be defined on some 
domain D in the xy-plane and let (x, y) be some point in D. We let Ax 
and Ay denote the increments in x and in y such that (x + Ax, y + Ay) € D. 
Definition. The function z = f(x, у) is said to be differentiable at 
(х, y) € D if the total increment 


Az = f(x + Ax, y + Ay) - Дх, у) 


corresponding to the increments Ax and Ay admits a representation of the 
form 


Az = A Ax + B Ay + a(Ax, Ay) Ax + B(Ax, Ay)Ay, (*) 


where A and B are independent of Ax and Ay (though they depend upon 
x and y in general) and a(Ax, Ay) and 8(Ax, Ay) tend to zero as Ax and 
Ay approach zero. 

If a function z — f(x, y) is differentiable at a point (x, y) the linear 
(relative to Ax and Ay) part A Ax + B Ay of its increment is called the 
differential of z — f(x, y) at the point (x, y). 

We denote the differential of a function z = f(x, y) by dz and write 


dz = A Ax + B Ay. 


Thus, Az = dz + а Ax + B Ay. 
Example. Find the differential of the function z = x? + y’. 
ч. Given any point (x, y) and any increments Ax and Ay there holds 


Az = (x + Ах)? + (у + Ay – х2 - y? 
= 2x Ax + 2y Ay + Ax Ax + Ay Ay. 


Here we have A = 2x, 8 = 2y, «(Ах, Ay) = Ax and (Ах, Ay) = Ay. Hence 
a and 8 tend to zero as Ax > 0 and Ay > 0. 

By the definition of differentiable functions the given function is 
differentiable at every point in the xy-plane and dz = 2x Ax + 2y Ay. ®> 

Notice that we have not excluded the case when either Ax or Ay or both 
are equal to zero. 

We can abbreviate the formula (*) by introducing the distance between 
the points (x, y) and (x + Ax, y + Ay) as 


e = У(Ах)? + (дуу. 


Then we can write 


o Ax + B Ay = («ao (о #0) 
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Or 
а Ax + B Ay = EQ, 


Ax A 
where £ = а 23 + 8-2 depends upon Ax and Ay and tends to zero as 


Ax э 0 and Ду > 0 or, using the abbreviated notation, as ọ > 0. 
Thus formula (*) expressing the condition for the function z = f(x, y) 
to be differentiable becomes 
Az=AAx+ BAy + EQ, 


where ғ = &(о) > 0 as о > 0. For instance, turning to the previous example, 
we have Az = 2x Ax + 2y Ay + (Ax)? + (Ау) = 2x Ax + 2y Ay + о? 
where ё(0) = о. 


Necessary conditions for a function to be differentiable. We shall prove 
the following important theorems. 

Theorem 12.4. If a function z = f(x, y) is differentiable at some point 
then f(x, y) is continuous at this point. 
ча Indeed, if z = f(x, y) is differentiable at a point (x, у) then the incre- 
ment in z at (x, y) corresponding to the increments Ax and Ay admits a 
representation of the form 


Az = А Ах + B Ay + а Ах + OG Ay, 


where A and B are constant at (x, y) and а > 0 and 8 > 0 as Ax > 0 and 
Ay — 0, whence lim Az = 0. This means that the function z = f(x, y) is 
$28 
continuous at the point (x, y). » 
Theorem 12.5. If a function z = f(x, y) is differentiable at a point then 


f(x, y) has partial derivatives A and E at this point. 


ча Indeed, let z = f(x, y) be differentiable at a point (x, y). Then the incre- 
ment Az corresponding to the increments Ax and Ay can be written as 
Az = А Ax + B Ay + a(Ax, Ay)Ax + B(Ax, -Ay)Ay. 
Put Ax ~ 0 and Ay = 0. Then the above formula gives 


Az = A Ax + a(Ax, 0)Ax. 
Whence 


А; 2 E 
Ax A + a(Ax, 0). 


Since the value of A is independent of Ax and a(Ax, 0) > 0 as Ax > 0 then 
Р А; 2 
А = 1 : 
гэ Ах 


Ax0 
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This implies that at the point (x, y) there exists a partial derivative of 
z= f(x, y) with respect to x and 


9: 


дх 

By similar reasoning we infer that at the point (x, у) there exists a partial 
derivative of z — f(x, y) with respect to y and z -8 » 

It is worth mentioning here that Theorem 12.5 states that there exist 
partial derivatives at the point (x, y); however, it tells nothing about con- 
tinuity and the behaviour of the partial derivatives at (x, y) and in a neigh- 
bourhood of the point (x, y). 

From Theorem 12.5 it follows that 


_ 94 дг 


Sufficient conditions for a function to be differentiable. Recall that for 
a function y — f(x) of one variable to be differentiable at the point xo it 
is necessary and sufficient that y — f(x) have a derivative f'(x) at xo. 
However we can not set forth similar necessary and sufficient conditions 
even for a function z — f(x, y) of two variables; the sufficient conditions 
exist apart from the necessary conditions discussed above. The former are 
given by the following theorem. 

Theorem 12.6. Let a function z — f(x, y) have partial derivatives f; and 
J, in some neighbourhood of a point (xo, yo) and let f; and /, be continuous 
at (Xo, yo). Then z = f(x, y) is differentiable at (xo, yo). 

The proof can be found elsewhere in the relevant literature. 

Example. Investigate the function f(x, y) = Уху : 
-4 This function is defined everywhere. By the definition of the partial 
derivative we have 








M ju 
, qus ЛАх, 0) – 0, 0). МАх-0 -0 — 
Oe a 
and 
Зато 
, _ 4, ЛО, Ду) – ЛО, 0)  , VO-Ay -0 _ 
Deg. = yr жш 


To define whether f(x, y) is differentiable or not at the point O(0, 0) 
we compute the increment of f(x, y) at O(0, 0); we have 


А/О, 0) = VAx Ay = e(Ax, Ay)e. 


544 12. Functions of Severa! Variables 


Since o = V (Ax)? + (Ay? then 
у Ax Ay 

N (Ax! (Ay 

For the function f(x, y) to be differentiable at O(0, 0) it is necessary 
that the function e(Ax, Ay) be an infinitesimal as Ax ^ 0 and Ay > 0. Put- 
ting Ay = Ax > 0, we can write e(Ax, Ay) as 
(Ax? 
Ax. 

: 3 
It is easy to see that є (Ax, Ay) ^ œ as Ax > 0 so that f(x, y) = Уху 


is not differentiable at O(0, 0) although f(x, y) has partial derivatives f; 
and f, at O(0, 0). This result is attributed to the discontinuities of f; and 


Ју at O(0, 0). » 


Total differential and partial differentials. If a function z — f(x, y) is 
differentiable then the total differential is given as 


dz = А Ах + B ôy. 





(Ах, Ay) = 


(Ах, Ay) = 





Noting that A = ee and B= 28, we can write dz as 
дх ду 
ШИ 
dz = dx Ax + p Ay. 


To extend the notion of the differential we put differentials of indepen- 
dent variables equal to their increments so that 


dx = Ax and dy = Ay. 
Then the total differential of f(x, y) can be written in the form 


2 
dz = — dx + a dy. 
Example. Compute the differential of the function z = In(x + y’). 
ча We have 


2y _ zu А 


2 » 
x+y x+y? 








dz = l ; dx + 
x+y 


Analogously, if и = f(xi, Хо, ..., Xa) is а differentiable function of n 
independent variables then 
n 


= >) Č as (ах = Ax). 
zai OXk 


12.4 Derivatives of Composite Functions 545 


The relation 
dz = f(x, y) dx 
is called the partial differential of a function z — f(x, y) with respect to 
the variable x, and 
dyz = fj (х, y) dy 


is called the partial differential of z — f(x, y) with respect to the variable y. 
Using the above formulas for dz, dz and d,z, we can write the differen- 
tial of z — f(x, y) as 


dz = dyz + dyz, 


ie, the total differential of z — f(x, y) is the sum of partial differentials. 
Notice that in general the total increment Az in z — f(x, y) is not equal 
to the sum of the increments in x and y. 

Suppose that a function z — f(x, y) is differentiable at the point 
(x, y) and dz z 0 at (x, y). Then the total increment 


92 92 


A => — ааа Бая 
z=- ®t ày Ay + a(Ax, Ay)Ax + B(Ax, Ay)Ay 
differs from the linear part 
д2 д: 


only by the sum о Ax + В Ay where Ax and Ay as Ax > 0, Ay > 0 are in- 
finitesimals of higher orders than the summands in dz; so if dz ~ 0 the 
linear part of the total increment of a differentiable function is called the 
principal part of this increment and the approximation 


Az = dz 


is widely used. Notice that the smaller the absolute values of the increments 
in arguments the higher the precision of the approximation. 


12.4 Derivatives of Composite Functions 


Let a function z — f(x, y) be defined on some domain D in the 
xy-plane and let both x and y be functions of a variable / so that 


x= e(t), у = (f), to c t« th. 


We assume that for any point / in (fo, 44) the corresponding point (x, y) 
is contained in D. Then the substitution x = (f) and y = y(t) reduces 
z = f(x, y) to the composite function z = f[e(t), 4(0| of one variable t. 


35—9505 
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Theorem 12.7. If at a point t there exist the derivatives 


& - e'(t) and 9. = yo 


and a function f(x, y) is p given x = e(t) and y = y(t) then 
the composite function z = fle(t), WO] has a derivative = at t and 
dz Oz dx 9 dy 


dt ах dt ду dt^ 





ча Let At be an increment in £ which gives rise to the increments Ax and 
Ay in x and y and, hence, to the increment Az in the function z. Recall 
that 4 = f(x, у) is a differentiable function at (x, у). Then if 
(Дх)? + (Ay)? x 0 the increment Az can be written in the form 


Az = бе Ах 3 Ay + a dx + Boy, 
where a(Ax, Ay) and б(Ах, Ay) tend to zero as Ax > 0 and Ay 0. If 
we put a(0, 0) = 0 and 6(0, 0) = 0 for Ax = Ay = 0 then a(Ax, Ay) and 
B(Ax, Ay) will be continuous as Ax = Ay = 0. 
Consider 
Az _ д: Ах , дг Ay 
At ax М ду At 





Ax Ay 
ta АТ + 8 “АГ? (*) 
Both factors of each summand on the right have limits as At > 0. In- 


: MC 9 9 : . 
deed, the partial derivatives 23 ап4 S are constant at the given point 


дх ду 
(x, у). By the hypothesis there exist limits 
Ax dx Ay 
Aur ДО арг улаа m АС ар VD: 
4х dy . : : 
Since dr and cae exist at a point / the functions x = (t) and y = ү(/) 


are continuous at this point. This implies that as At > 0 both Ax and Ay 
tend to zero and, hence, so do a(Ax, Ay) and B(Ax, Ay). 

Thus the right-hand side of (*) has a limit as At — 0 equal to 
9c 106 2:200 99 Tte means that the Ieftchamd ade ot (0) also has 
дх dt ду dt Az d 


a limit as At > 0 so that there exists lim —— = ——. Evaluating the limits 
At0 At dt 





in (*) as At > 0, we obtain 
dz Oz dx дс dy 


di ox йш. ду dt’ (n) 
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Example. Compute the derivative of the function z = x^ + y? where 
= sint and y = ѓ?. 
ча Using the above formula, we get 
dz 
dt 


Let us consider a function z = f(x, y) where у = y(x), z being a compo- 
site function of x so that z = f(x, ¥(x)). Then 


dz Oz 9: dz 


"dx ~ 8x | ду dx’ 


= 2x cost + 2y 302 = 2sint cost + 6 = sin2t + 65. » 


az. : NP : 
where 56 is a partial derivative of z = f(x, y) with respect to х computed 


2 шил МОЕ © 
by regarding the variable y as а constant. The derivative de is the zotal 


derivative of z = f(x, y) with respect to the independent variable x comput- 
ed by regarding y as a function of x, ie, у = y(x) so that the relationship 
between z and x is taken into account. 





oz dz шан 13 
Example. Compute EI and E provided that z — tan = and 

у = ж. 

az a -17 y д2 х 
e = Ер - 

ax  ðx (tan >) ety? ду x+y 

dz 94, д: dy y x 1 
and —— = — + — —— = -r + SO N = — 

dx дх dy dx x+y ху 14+ x 


Now we draw our attention to differentiation of a composite function 
of several variables. Consider the function z = f(x, y) where x = e(£, n) 


and y = ¥(€, п) so that z = z(£, n) = f(e(En), V(£, n)). Let continuous par- 
tial derivatives es id ду 

дЕ’ On’ Ot 
f(x, y) be differentiable at the corresponding point (x, y) where x = e(£, n) 
and y = v(£, n). 


ыг Би that the composite function z = z(£, 7) has the derivatives i Ё 


and x at the point (£, 7) and derive the respective formulas. Notice that 


this case is almost similar to that considered before. Indeed, on differentiat- 
ing z with respect to £ the variable y is regarded as a constant; this implies 
that x and y become functions of one variable £, ie, x = e(£, c) and 
у = V(£, c) so that formula (**) is fully applicable here. 

Using (**), we obtain 


Oz Oz Ox Oz ду 


Ot Әх OE” ду OE 


д , 
and r1 exist at a point (£, 7) and let 


35* 
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Analogously 
94 д Ox 9: ду 


u.a Oe Ua ау 


Example. Compute the partial derivatives = and E of the function 
£ 


z = х?у — xy? where x = £y and y = 5. 
7 








a _ д@ дх az ду _ ET 55 i 
= ð x д F dy ð = (2xy – yn + (x 2ху) = 
2 Ё E) (er D = ( y 
2 ~ 2 c Ру хез —— 
ЄТ т n+ (En 267 = 38 (s : 
and 


az _ д: дх az ðy E jer К: 
ws caus ce co WEM So r3 


2 
= (245 - Ee (ет - 202) (- 5) - 660 2). > 
7 7 7 7 n 


If a composite function u = f(x, y, 2) is given by x= x(E, n), 
y = у(&, п) and z = z(£, т) so that и = u(£, т) then 


ди ди дх | ди ду г ди Oz 


8E Әх OL ду àt Oz ot 








and 





ди _ ди ôx , ди ду , ди 92 
an — дх ð ду an 92: дт’ 


In particular, if и = f(x, у, 2) where z = z(x, у) then 


ðu af Әу az ðu af |f az 
æ ax | дт Ox and ay SDS az ду” 





ди . : M ; : 
Here ax sa total partial derivative of 4 with respect to tbe indepen- 
Р | : : of . 
dent variable x allowing for the relationship between u and x, А 15 а 


partial derivative of u = f(x, у, 2) with respect to x computed by regarding 
: ди д ЭР? 
the variables y and z as constants. The meanings of y and B are similar 


ди of : 
to those of E and Эу” respectively. 
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Differentials of composite functions. If z — f(x, y) is a differentiable 
function of independent variables x and y then the total differential dz of 
z is given by 

dg- aug © а (9 
7 x ay Y» 
where dx = Ax and dy = Ay. 

Now we suppose that z = f(x, y) is a composite function. Assume that 
X = e(£, т) and y = v(£, л) and that these functions have continuous par- 
tial derivatives with respect to £ and у at a point (£, з). Let there exist 
continuous partial derivatives = and Ён at the point (x, y) corresponding 
to (£, n) so that z = f(x, y) is differentiable at (x, y). Then the function 
z = fle(£, n), V(£, »)] has at the point (£, з) the derivatives 


az Oz Ox 9: ду 


98 ax Of ду 8t 





and 
д2 Oz Ox 9: ду 


an ~ ax др ду дт” 





à 
It is easy to see that and e are continuous at the the point 


3E 
(&, 7). Hence, the function z = f[e(£, n), v(£, »)] is differentiable at 
(£, n) and by virtue of the formula for the total differential of independent 
variables 1) ис £ we have 

92 


dz = 3b 3 dy. 


ЭЛ : 9 
Substituting the expressions for + and а into the above formula, 


we get 


(9x ax 9: ду dz дх Oz ду 
acc (5 aE Әу зг )а + (ж óy ^ ду an 


_ 94 ( дх Ox 92 ду 
4 - Ax dits а) +5 L4 2 ал). 

By the hypothesis the functions x = e(£, л) and y = (5, з) have con- 
tinuous partial derivatives at the point (£, 7). Then they are differentiable 


at ($, т) and 








Or 








dx = —— dts ay and dy = 


Р ав--1- ; dn. 


Зр 
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From the above formulas for dz and dx it easily follows that 


д2 0z 
dz = E dx + dy dy. 

Comparing the above formula with (*) we infer that the total differential 
of the function z = f(x, y) is expressed by formulas of the same form when 
the variables x and y are independent and when x and y are functions of 
some other variables. Thus the form of the expression for the total differen- 
tial of a function of two and more variables remains the same (invariant) 
in both these cases. 

Remark. Based upon the previous results it is easy to verify that the 
differentiation formulas 


d(x + y) = dx x dy, 
d(xy) = xdy + y dx, 


a(*) 5 DE | 


remain valid for x and y being differentiable functions of any finite number 
of variables, ie, for x = e(£, n, 6, ...) and у = ұ(&, n, ё, ...). 


12.5 Implicit Functions 


Consider the equation F(x, y) = 0 where F(x, y) is a function of 
two variables defined on the domain G in the xy-plane. We suppose that 
for every x in some interval (xo — ho, хо + ho) there exists the only value 
of y which satisfies the equation F(x, y) = 0; clearly in this case the func- 
tion y = у(х) such that the identity F(x, у(х)) = 0 is fulfilled for all x in 
(хо — Ао, Xo + h) is fully determined. Then we say that the equation 
F(x, y) = 0 specifies the quantity y as an implicit function of x. 

In other words, the function y = y(x) specified by the equation 
F(x, y) = 0 involving y is called the implicit function. To express the rela- 
tionship between y and x explicitly we have to solve the original equation 
with respect to y. 

Examples. (1) The equation y — x — O specifies y as a single-valued 
function y = x of x at every point of the x-axis. 

(2) The equation y — x — e sin y = 0,0 < є < 1, specifies y as a single- 
valued function of x. 
ча To verify this we investigate the original equation y- x- 
є sin y = 0. Notice that this equation is satisfied if x = 0 and y = 0. We 
regard x as a parameter and consider the functions = = у and 
z= x + ESin y. It is clear that if for a given xo there exists the only value 
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yo such that the point (xo, yo) satisfies the original equation, then the line 
z = y and the curve z = xo + є sin y intersect at a unique point and vice 
versa. Let us draw the graphs of z = y and z = хо + € sin y in the zy-plane 
(Fig. 12.11). 

The graph of z = x + e sin y where x is thought of as a parameter is 
obtained by translating the graph of the curve z = є sin y along the z-axis. 
It is easy to see that the graphs of z = y and z = x + e sin y meet at the 
only point whose ordinate y is a function of x implicitly specified by the 
equation y — x — € sin y = 0, 0 < є < 1. This relationship can not be ex- 
pressed in elementary functions. 9» 


хү 







Z-X.*tESin gy 


Z=x9+E SİN y 
Xo 


z=csiny 


Fig. 12.11 


(3) The equation x? + y? + 1 = 0 does not specify y as a real-valued 
function of x whatever is a real value of x. 

Analogously we can introduce the notion of an implicit function of 
several variables. 

The following theorem specifies the sufficient conditions for the equa- 
tion F(x, y) = 0 to be solvable for y in some neighbourhood of a given 
point хо. 

Theorem 12.8. Let there be given an equation F(x, y) = 0 and let the 
following conditions be satisfied: 

(i) the function F(x, y) is defined and continuous in the rectangle 
Dí(xo— 061 < x € xo + ài, yo-&<y<yt à), & > 0, ё > 0, with 
centre at (xo, Yo); 
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(ii) the function F(x, y) vanishes at the point (xo, yo) so that 
F(%, yo) = 0; 
OF 


529 Р : : m дЕ : 
(iii) there exist continuous partial derivatives “эр апа y in D; 


(ivy SFO, Yo) == yo) до, 


Then given any sufficiently small positive number є there exists a neigh- 
bourhood хо — бо < Xo < Xo + ĉo Of Xo, о > 0, where there exists a unique- 
ly defined ® continuous function y = f(x) (Fig. 12.12) such that yo = f(xo), 
|» — yo| < £ and the equation F(x, f(x)) = 0 becomes an identity, їе. 

F(x, f(x)) = 0 


for ail x in the given neighbourhood of xo. 





Fig. 12.12 


The function y = f(x) is continuously differentiable in the neighbour- 
hood of Xo and 


OF 

dy Эх aF 

бу. SO, f ТҮР 

dx ЭР (5; ) (9) 
y 


Э We regard у = f(x) as "uniquely defined" in the sense that the coordinates of any 
point which lies on the curve F(x, y) = 0 and is contained in the neighbourhood 
Q = (xo — 6o < X € Xo + 5p, Yo — € € у € Yo + E] of the point (xo, yo) are related by the 
equation y = f(x). 
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ча We deduce (+) for an arbitrary implicit function assuming that the 


7; 4 : 
derivative e exists. 
dx 


Let y = f(x) be a continuous implicit function given by the equation 

F(x, y) = 0. Then for any x in (xo — бо, xo + ёо) there holds the identity 

dF(x, "PEE 

F(x, f(x)) = 0; hence PU Se) = 0 for any point in this neighbourhood. 
By the chain rule of differentiating a composite function we have 


dF(x, y(x)) | OF | OF dy 





dx дх ду ах: 
Therefore for y = f(x) we get 
Sr OE. D. 
ox ду dx 
Whence 
dy OF /ax 
dx AF /ay ` 


Example. Compute 2 of the function y = y(x) defined by the equa- 
tion х2 + y? = R?, 


4 Here we have F(x, y) = x? + у? – R? and ын = 2x and X = 2y. 
Using (*), we obtain 2 == y (у #0). > 


Remark. Theorem 12.8 specifies the sufficient conditions for an implicit 
function whose graph passes through a given point (Хо, yo) to exist. However 
it tells nothing about the necessary conditions. Indeed, consider the equa- 
tion (y — х)? = 0. Here the function F(x, у) = (y — xf? has continuous 
partial derivatives Ру and Ру and Ру = 2(y — x) is equal to zero at the point 
O(0, 0); nevertheless the equation F(x, y) = (у — х)? has the unique solu- 
tion y = x vanishing at x = 0. 

Problem. Consider the equation y? = х2. Let y = у(х), -œ < x < 
+ оо, be а single-valued function satisfying y? = x’. 

Find out how many (1) single-valued functions of the form y = у(х), 
— o « x < + c; (2) single-valued continuous functions; (3) single-valued 
differentiable functions; (4) single-valued continuous functions y — y(x), 
1-8<x< 1+, if y(1) = 1 and 6 > 0 is sufficiently small, satisfy the 
equation у? = x°. 

Similarly to Theorem 12.8 we can formulate a theorem for an implicit 
function z= (х, у) of two variables specified by the equation 
F(x, у, 2) = 0. 
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Theorem 12.9. Let there be given an equation F(x, y, z) = 0 and let 
the following conditions be satisfied: 

(i) the function F(x, y, z) is defined and continuous on the domain 
D such that 


Xo — ôi < X € Xo + бу, 
D = {yo - о <у<у + бу, 
Zo — з < Z < 20 + бз, 


where бу > 0, 62 > 0 and ёз > 0; 

(1) F(xo, Yo, zo) = 0; 

(ili) there exist continuous partial derivatives F,, F; and Fz in D; 

(iv) FQ, Yo, zo) # 0. 

Then given any sufficiently small є > 0 there exists a neighbourhood 
Q of the point (xo, yo) where there exists a uniquely defined continuous 
function z = f(x, y) such that z = f(x, y) attains the value zo at x = xo 
and y = уо, |Z — %| < є and the equation F(x, у, 2) = 0 becomes an identi- 
ty, ie, F(x, y, f(x, y)) 0 for all (x, у) in Q. The function z = f(x, y) 
has continuous partial derivatives f; and f; in Q. 

Let us deduce formulas for f; and fy. Suppose that the equation 
F(x, y, 5) = 0 specifies z as a single-valued differentiable function 
z — f(x, y) of independent variables x and y. If we substitute f(x, y) for 
z then F(x, y, z) = 0 becomes the identity 


F(x, y, f(x, у) = 0 (x, у)60. 


Hence, the partial derivatives of F(x, y, z) where z — f(x, y) with respect 
to x and y must also be equal to zero. On differentiating, we get 


ЭР | дЕ да _ aug OF , OF дс _ 
дх дт Ox — ду дс ду. 

Whence 
дд _ __дЕдх nd аЕ/ду ( ӘЕ o 
ax OF/az ðy  8OF/oz NOR ` 


These are the formulas for partial derivatives of an implicit function 
of two independent variables. 

Example. Compute the partial derivatives of a function z(x, y) given 
by the equation F(x, y, 2) = х? + y? + 22 – R = 0. 


дЕ дЕ дЕ д х 
«4 — — ——— ИШЕ ——= 5 == 
We have Эх 2х, ду 2y and 3z 2z; whence Эх > апа 
az у 
LE LA 0). > 
ду z e 


To verify the results we can find an explicit formula for the function 
and apply standard formulas for partial derivatives directly. 
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12.6 Tangent Planes and Normal Lines to a Surface 


Let 5 be a surface given by the equation F(x, y, z) = 0. 
Definition. The point M(x, y, z) on a surface S is called the regular 
. D F F 
(nonsingular) point of S if all the three partial derivatives - ES 


дЕ : : Мин 
and ae at M exist and are continuous and at least one of them is distinct 


from zero. 
The point M(x, y, z) is called the singular point of S if all the three 
partial derivatives д 3 oF and oF vanish at M or at least one of them 
ox” ду 94 
does пої exist at М. 





by. 


Fig. 12.13 Fig. 12.14 


Example. Consider a circular cone given by the equation x? + y? — 
z? = 0 (Fig. 12.13). 
дЕ 


*4 Here we have F(x, у, 2) = x? + y? — z? so that ын = 2x, "my 2y 
OF 
and —— = —2z. The only singular point is the origin of coordinates 


д2 
ООО, 0, 0) where all the three partial derivatives vanish. » 
Let L be a space curve given by the parametric equations 


x= et), 
у= (0), a«t« 8. 
z = w(t), 
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Suppose that e(t), V(t) and w(t) have continuous derivatives до (t), Y’ (t) 
and w’(t) at every ¢ such that a < < 8. We leave aside singular points 
of L where e'?(t, + y’? (0 + w’? (t) = 0 and consider a regular 
point Мо(хо, Yo, zo) on L specified by the value fo of /, to € (a, 8). Then 
the vector т = x'(fo)i + y'(fo)j + 2’ (%)k lies on the tangent to the curve 


L at the point Mo(Xo, Yo, 20). 
Now we choose a regular point P of a surface 5 and draw through Р 


a curve L lying in S. Let the curve be given, as before, by the parametric 
equations 


x = e(t), 
у= (0, a<t<8 
z = w(t), 


and let e(t), (0) and w(t) have continuous derivatives that nowhere in с(8) 
vanish simultaneously. 

By the definition the tangent to L at P is called the tangent to the sur- 
face S at P. 

If the above parametric equations are substituted into the equation 
F(x, y, z) = 0 of the surface S the latter becomes an identity with respect 
to Г so that F(e(t), Y(t), w(t)) = 0 since the curve L lies in the surface S. 

Differentiating this identity as a composite function of f, we get 


OF dx " OF dy OF dz 

ах dt ду dt | az dt 
The expression on the left is a scalar product of the vectors 
9Е, дЕ. OF dx dy dz 


n= ae age a; к and += dp it at di k. 


The vector 7 is a tangent vector to the curve L at the point P. The vector 
n being independent of the shape of the curve passing through the point 
P depends only on the coordinates of P and on the shape of the function 
F(x, y, z). 

Since P is a regular point the length of m 


н б СУ 
ox ду Oz 
is distinct from zero. 

The identity (ж) implies that (n, т) = 0. This means that the vector т 
tangent to the curve L at the point P is perpendicular to the vector n at 
P (Fig. 12.14). The same reasoning is fully applicable to any curve lying 
in the surface S and passing through the point P. Hence, any tangent to 
the surface S at the point P is perpendicular to the vector n and thus all 
such tangents lie in the same plane perpendicular to the vector n. 


= 0. (*) 
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Definition. The plane formed by all tangents to a surface S through 

a given regular point P € S is called the fangent plane to the surface at P. 

_ { OF OF дЕ 
Б дх 


р oy д2 
the tangent plane F(x, у, 2) = Oat the point P. Whence we get the equation 
of the tangent plane to the surface F(x, y, z) = 0 at the regular point 
Po(Xo, Yo, Zo) as 


дЕ дЕ дЕ 
Gr) A ыг (5) шаг? эн (5) 


If the surface S is given by the equation z — f(x, y) we can write it 
_ oe OF Of OF _ af OF —— 
as F = f(x, y) — z = 0; whence or ae oy a and am l. 


Then the equation zo — f(xo, yo) of the tangent plane at the point 


Po(Xo, Хо» Zo) becomes 
of 
(x Xo) Ж (2 ) 


e) 


Geometric interpretation of the total differential. The substitution 
X — xo = Ax and y — yo = Ay reduces the above equation to 


_(a af 
ice (80,875 (5) 


The expression on the right is the total differential of the function 
z = f(x, y) at the point Mo(%, yo) in the xy-plane so that z — zo = dz. 

Therefore the total differential of the function z = f(x, y) of two in- 
dependent variables x and y at the point Mo corresponding to the incre- 
ments Ax and Ay is equal to the increment z — zo of the applicate z of 
the tangent plane to the surface at the point Po(xo, yo, /(xo, yo)) obtained 
in moving from Мо(хо, yo) to M(xo + Ax, yo + Ay). 

Definition. The line that passes through the point Po(xo, уо, 20) of the 
surface F(x, y, z) = 0 perpendicularly to the tangent plane to this surface 
at Ро is called the normal to the surface at Po. 

: : OF OF дЕ : 
It is easy to notice that the vector n = { ——, ——, —— is the 
( ax’ Oy az | | 
direction vector of the normal given by the equations 


The vector n 

















] is a normal vector of 


, 
P P. 


(z — 20) -0. 
(Xo, Yo, 2o) 














(у — Yo). 
(X0. 0) 








(х,у) 


Ау. 
(х,у) 








X — Xo = У — Yo = 2- % 


дх (хо, Yo, Zo) ду (Xo, Jo, Zo) дг 


(хо, o, 20) 
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If the surface S is specified by the equation z = f(x, y) the equations 
of the normal at the point Po(%, Yo, zo) become 
X — Xo E y-» . 2 — 20 ! 


CONES NES 
9х / | qs, y) ду / Ту) 


Example. Write down the equations of the tangent plane and the normal 
to the surface z = x? + y? at the point O(0, 0, 0). 
ча We have f(x, y) = х? + y? so that = = 2x and B — 2y; these 
derivatives vanish at the point (0, 0), i.e., /,(0, 0) = fj(0, 0) = 0. Then the 
equation of the tangent plane becomes 


z — 0 = 0(х – 0) + 00 – 0), 


ie, z = 0 (the tangent plane is the xy-plane). 
The equations of the normal are 





0 0 1 


х-0 y-0 z-0 -0 
y=0 


i.e., the normal is the z-axis. В» 


12.7 Derivatives and Differentials of Higher Orders 


Let a function z = f(x, y) have partial derivatives 38 ana Я ы 


E р — д 
every point in: some domain D. Clearly, these derivatives 32 = fix, y) 
and $ = fy(x, у) are functions of x and y in D; they can have deriva- 


tives at some or all points of D. 

The partial derivatives of A and =, if they exist, are called the se- 
cond derivatives of the function z = f(x, y) or the partial derivatives of 
the second order. Given a function z = f(x, y) of two independent variables 


х and y we can write partial derivatives of the second order as 
а (әү да _ py а (o. Pu _„ 
дх\дх/ a ^" à \ Ox дудх 25 


ә (B) д _„ ә (2) Nx 6 
“ax 1Хду) дхду | ay \ ay ay vU 
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The derivatives fy, and fy, are called the mixed partial derivatives; the 
former is computed by differentiating the given function first with respect 
to x and then with respect to y and the latter is computed by differentiating 
the function first with respect to y and then to x. 

The partial derivatives of the third and higher orders can be defined 
in a similar way. 

Example. Compute the first and second derivatives of the function 
z= ху? — xy. 











ча We have 
ðZ 42 2. 13 02.5 3 Е 2 
dx = X» LU aa 3ху”, 
2 2 
ы = ху?, 2 = 2x? – бху, 
ox ду 
д°@ 2 2 az 2 2 
————— = — = — 2 р 
ду дх SEO ax ду кезу 


Notice that the mixed partial derivatives z,, and zy, are identically equal. 
The explanation of this result is given by the following theorem. 

Theorem 12.10. Let a function z = f(x, y) have partial derivatives fy, 
Jy, fy, and fy, in some neighbourhood of a point Мо(хо, yo) and let fy, 
and fy, be continuous at Mo(%, yo). 

Then foxo, Yo) = Ју (0, Yo) at Mo. 

It is important that the mixed partial derivatives f;, and fy, be continu- 
ous at Мо(хо, yo). For example, the function 
2 


2) 
5 E s x! + у? z 0, 
Дх, у) = bid 
, 0, х=у=0 


have mixed partial derivatives f;, and fj, which are not continuous at the 
point O(0, 0) so that /5(0, 0) = —1 and /,;(0, 0) = 1. 

In general, the mth mixed partial derivatives (т > 2) of a function 
и = f(x, 0, ..., Xn) have the same value at a given point if they are con- 
tinuous at this point and are distinct from each other by the order in which 
the function is differentiated with respect to its variables. 


Differentials of higher orders. Let z — f(x, y) be a function of variables 
x and y defined on the domain D. 

If z = f(x, y) is differentiable on the domain D then the total differen- 
tial of z = f(x, у) at the point (x, у) € D corresponding to the increments 
dx and dy in the variables x and y is given by the formula 
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where dx = Ax and dy = Ay are arbitrary increments in independent vari- 
ables, i.e., these are arbitrary numbers independent of х and y. So we can 
vary x and y keeping dx and dy constant. Given dx and dy the total differen- 
tial dz is a function of x and y, which can also be differentiable. 

Definition. The total differential of dz at a point (x, y) corresponding 
to the increments of independent variables equal to dx and dy is called 
the second differential or the differential of the second order of a function 
z = f(x, y) and is denoted by the symbol dz so that d?z = d(dz). 

Let a function z = f(x, y) have continuous partial derivatives of the 
first and second orders in the domain D. Then the total differential dz of 
z = f(x, y) is a differentiable function so that in D there exists 422. 

Recall that dx and dy are constant. Then using the laws of differentia- 
tion, we obtain 


Oz Oz az 9: 
d* = J; ——— ——— = — ——— 
z=d(dz) 45 dx + ày ay) (x ax) + ($ ay) 
= = (= 5) а + (S dy. 
ay Y. 


Applying the formula for the total differential to 3 and 3 we get 


az\ _ д (д: a (дс _ az ә 
a(z) = x (ae) + m (R)e- 9х2 ™ + ayax ©” 


az a (д: a (az az az 
d — = —— —— — ———— = 
(55) бах © a7 ду? dy, 


Substituting these relations into the formula for цар we arrive at 














2 
dz = 32 (dx)? + 





yd dx dy + ———— 


az 9 
дудх Әдхду 
ous functions we улыг obtain 

az 
z= 2 
$ 2 rd lY ay 


where dx? = (ах)? and dy? = (ау). 
It is convenient to write the above formula by applying the notation 


Since and both mixed partial derivatives are continu- 





a 


д : 
Be dx + dy dy; then we can write 


д à 2 
2, 5 
do (gar “ХЭВ 
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The symbols and * are regarded as some “multipliers” so that by 


9 
x 
completing the square as usually done in algebra, we arrive at the desired 
result. By way of illustration we write 











a a 2 g д? 8” 
2 = 2 2 
d'- (3 dx By dy) 32 dx’ +2 ax ду dx dy+ ay? dy. 


Multiplying both sides termwise by z and inserting z into the nominators 
of the “fractions” on the right, we get the formula previously deduced, 
namely 
3 8": 922 
$ E TO Ur ay? 

Formulas for differentials of the third, fourth, etc., orders can be derived 
in a similar way. In general, the total differential of the mth order denoted 
by d"z is the total differential of the total (n — 1)th differential so that 
d"z = d(d"~'z). 

If z = f(x, y) € C"(D) has continuous partial derivatives up to the nth 
order derivatives then there exists the total differential of the mth order 
given by the formula 


E As) д n 


The total nth differential of a function и = f(xi, Xo, ..., Xm) of m inde- 


2 





pendent variables xi, Хо, ..., Xm is given by the formula 
C (9. à д " 


provided that the HN ym are satisfied. 

Remark. If x and y are not independent variables but some functions 
£ and q then the form of the second differential does not remain invariant. 
ча Indeed, let z = f(x, y) where x = e(£, n) and y = (Ё, n); then the first 
differential can be written as 


di = dx + ŽE dy, 


where dx and dy are some functions and, hence, they cannot be constant. So 


2, _ д E 
dz = а(5у )а «(S SAE ax d(dx) + ày d(dy) 
_ 9 9 a » a 2 
= (ape) zt ax+ tay. d^y. 
Thus, the form of the second differential is not invariant in this case. 9» 


36— 9505 
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12.8 Taylor's Theorem 


Let a function z — f(x, y) have continuous partial derivatives up 
to the nth derivatives at every point (x, у) in some 6-neighbourhood of 
the point (xo, yo) and let a point (xo + Ax, yo + Ay) be contained in the 
ó-neighbourhood of (xo, yo) (Fig. 12.15). We put 


X= ж + tAx, 
у= yo + ty, 
where ¢ is some independent variable. Then 


z = f(x, y) = Хо + ГАХ, yo + t Ay) = e(t) 


(Xo*4X.Uo*49) 


(00) 





Fig. 12.15 


becomes a composite function of ¢ defined оп the closed interval [0, 1] 
and having derivatives up to the nth order on [0, 1]. This means that the 
function z = (t) can be represented by Taylor's formula in powers of f 
so that 


e'(0) e"(0) 2 e" 70) a-i 

















e(t) = 200) + Т. t+ 7! dx cR 4 
e? (6t) їл 
n! 4 
where 0 « 0 < 1. 
Put ¢ = 1. Then 
¢ (0) e" (0) e - 200) e 0) 


where 0 « 0 « 1. 
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We shall express the right-hand side of the above formula in terms of 
the functions f(x, y) and its derivatives. Notice that being the functions 
of t the variables x and y of f(x, y) have constant differentials dx = Ax dt 
and dy = Ay dt where Ax and Ay are some given numbers. This means that 
we can compute differentials of the functions z — f(x, y) by using the 
formula 


gis ахь 2 qq posi (їхайн do) 
ax ay 2 дх ду 


9 


p 
ay A) fe Dar, 


x fle, 9) = (ж Ax + 
so that 





d?z _ (р) a д д Р 
d? P 0 (6 В Ss Ay) fts. ». 


Put ¢ = 0. Then x = xo + f Ax = xo and y = yo + t Ay = yo. The above 
formula yields 





o (9 д Р Е = 
e = (ax “ду. Ay) fes. » xz% P=O, E В l. 
We put f = 0. Then 
д д n 
e (0) = e Ax + — y Ay) fo у) [ел 
у = yo + 8Ay 





Also notice that e(l) = /(хо + Ax, yo + Ay). 
Substituting the relations for (Р) (0), Ф“) (0) and ¢(1) into (+), we get 


SQ + Ax, yo+ Ay) = 00, yo) + (x Ax+ + ay) fee y) 





1 д 
+ —=— Эг (ge are эг ®) ® » 





Ус» 


1 9 9 m 
t. wax Ax + dy ay) Јо, у) 





+ — 1 a CR ant уу 4%) Л » 





= хо + 0Ax* 
= yo + бДу 


х 
y 
where 0 « 6 < 1. 


36* 
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This is Taylor's formula for a function z = f(x, y) of two variables and 


X = Xo + бАХ 
y = yo + 6Ay 


1/8 
Rn = QR 3x ах 0л Хх, у) 





is the remainder as given by Lagrange. 
To abbreviate the notation it is often convenient to denote f(x + Ax, 
Yo + Ay) — f(x, уо) by Af | os». Then Taylor's formula becomes 


I 
Af | вм) = df | (v.90) + эг d^f | оо») 


1 л-1 1 n 





(xo + 0Ax, yo + ӨДу) 


This formula is frequently used' as the approximation to the increment Af 
of the function z — f(x, y) at the point Mo(xo, yo). 

If the absolute values of Ax and Ay are sufficiently small and df z 0 
the differential df can be regarded as the approximation to Af so that 
Af = df. In this case the above formula involves only one term. When this 
approximation seems to be inadequate the desired precision is attainable 
by computing some other terms on the right. 

Example. Expand the function f(x, y) = е* sin y by applying Maclau- 
rin's formula with the remainder of order three. 
ча Taylor's formula with the remainder R; takes the form 


Јо + Ax, yo + Ay) = fo, Yo) + Л (Xo, yo) Ax + fj (Xo, yo) Ay 


4 DN WAX + 2/7, 0x, Yo)Ax Ay + fy, Xo, =з? 


+з m (x, ydx? 3f, уудх Ay + 3/2,(х, уулхАу? 





+ 45,06 »a | 


Putting % = yo = 0, Ax = x and Ay = y, we get Maclaurin's formula 
f(x, у) = ХО, 0) + f0, 0)х + FO, 0)у 


+ = [zo Ox? + 2/240, Oxy + 5/50, o» 
e pues уух? + 32 (0x, брду 


+ f, (0x, буу! + fi, (0x, э», 0<0<1. 
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Recall that f(x, y) = e* sin y; then 


f(x, y) = e* sin у, fO, 0) = 0; 
Si, y) = e siny, ЛО, 0) = 0; 
Р(х, y) = e* cos y, ЛО, 0) = 1; 
fao y) = e* siny, Уд0, 0) = 0; 
(X, y) = е" cos y, 750, 0) = 1; 


„ух, »)--esiny fy, 0) = 0; 
xt, y) = е" siny, Fi, (0x, бу) = ед sin бу; 
Хуб у) = e* cos y, (Өх, бу) = е cos бу; 


х, у) = -e'siny, f%,(0x, 6) = —e" sin by; 
x» хуу 
Лб, у) = –е" cosy, fj (Ox, бу) = —e™ cos 0y; 


and Maclaurin’s formula gives 
е siny = у + ху + I [е* sin 8y - x? + 3e™ cos8y · х2у 
—3e** sin dy · xy? — e** cosOy-y?]. > 


Remark. It is easy to notice that Maclaurin's formula admits a represen- 
tation of the form 


Дх, у)=/(О, 0) + Pi(x, у) + Р(х, у) +... + Р(х, y) + Rn, 


where P(x, y) is a homogeneous polynomial of degree k in x and y. 
Sometimes Maclaurin's formula for a given function f(x, y) can easily 
be obtained by applying asymptotic relations for infinitesimals. 


Example. Expand the function f(x, y) = by applying 


1 
(i х1 - у) 
Maclaurin's formula with the remainder as given by Peano. 
ча We have 


_ 1 1 1 
1—-х)(1—-у) 1-х 1-у 

(l+x+ x? + о(х?))1 + y+ у + oo?) 

1+х+у+х? + ху + у? + 0(07), 


where o? = x? + y?. » 
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12.9 Extrema of a Function of Several Variables 


Let a function z = f(x, y) be defined on some region D and a point 
Mo(%, yo) be an internal point in this region. 
Definition. If there exists a number ё > 0 such that for all Ax and Ay 
meeting the conditions |Ax| « 6 and |Ay| < 6 we have 


Af(xo, Yo) = Ло + Ax, yo + Ay) — f(%, Yo) < 0, 


then Mo(xo, yo) is called the point of a local maximum f(x, yy; if for all 
Ax, Ay meeting the conditions |Ax| « 6 and |Ay| «6 


Af(xo, Yo) = Ло + Ax, yo + Ay) — Лоо, Yo) 2 0, 


then Mo(%, yo) is called the point of a local minimum. 





Fig. 12.16 Fig. 12.17 


In other words, the point Мо(хо, yo) is the point of a maximum or a 
minimum of f(x, y), if there exists a 6-neighbourhood of Мо(хо, yo) such 
that at all points M(x, y) within this neighbourhood the increment 


Af = f(x, y) — f(Xo, yo) 


retains its sign. 

Examples. (1) For the function z = x? + у? the point O(0, 0) is a point 
of minimum (Fig. 12.16). 

(2) For the function z = 1 — x? — y? the point O(0, 0) is the point of 
maximum (Fig. 12.17). 
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(3) For the function 


2 2 
Дх, у) = 08 Viti dice 
, х= у = 0, 


the point O(0, 0) is the point of maximum (Fig. 12.18). 
~ There exists a neighbourhood of the point O(0, 0), e.g., a circle of radius 
1/2 (see Fig. 12.18) such that at all points in it other than 
O(0, 0) the value of the function f(x, y) is less than 1 = f(0, 0). » 
We will only consider points of strict maximum and minimum, i.e., ones 
for which the strict inequalities Af « 0 or Af > 0 are obeyed, respectively, 
for all the points M(x, у) in some 6-neighbourhood of the point Mo other 
than Mo. 





X Fig. 12.18 


The value of a function at a maximum point is called a maximum, and 
the value of the function at a minimum point is called a minimum of the 
function. Maximum and minimum points of the function are called extre- 
mum points of the function, and the maxima and minima themselves are 
called its extrema. 

Theorem 12.11 (necessary condition for extremum). 7f a function 
z = f(x, y) has an extremum at a point Mo(%, yo), then at that point each 
partial derivative 9д2/дх and ðz/ðy either vanishes or does not exist. 
ча Suppose that the function z = f(x, y) has an extremum at the point 
Мобо, yo). We will assign to the variable y a value yo. The function z = 
f(x, y) will then be a function of one variable x: z = f(x, yo). Since at 
X = xo it has an extremum (maximum or minimum; Fig. 12.19), its deriva- 
tive with respect to z at x = xo, i.e., (д2/дх) | гуу either vanishes or does 
not exist. Likewise, we see that 92/9У | (у) is zero or does not exist. > 
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Fig. 12.20 


Points where 92/дх and 0z/dy are zero or do not exist are called the 
critical points of the function z = f(x, y), points where д2/дх and 0z/dy 
are zero are called its stationary points. 

The theorem only gives the necessary conditions for an extremum. For 
example the function z = x? — y? has derivatives д2/9х = 2x, dz/dy = —2y, 
which become zero at x = y = 0. But at O(0, 0) this function has no ex- 


tremum. 
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Indeed, the function f(x, y) = x? — y? vanishes at the point O(0, 0) 
and at the point M(x, y) that are as close as possible to O(0, 0), it assumes 
positive or negative values. For it 

Af > 0 at points (x, 0) 
Af(0, 0) = f(x, y) -/(0, 0)2x? у? and 1 
(0, 0) =, »)--70, 0) y'a оо panis (0 J) 
at arbitrary small |x| > 0 and |y| > 0. 

Point O(0, 0) of this type is called a minimax point (Fig. 12.20). 

Sufficient conditions for an extremum of a function of two variables 
are given by the following theorem: 

Theorem 12.12. Let a point Mo(%, yo) be a stationary point of a func- 
tion f(x, y), i.e, 

Л Оо, Yo) = 0 and ју (х, yo) = 0 

Suppose that at some neighbourhood of the point Mo(%, yo) including 
Мо itself, the function f(x, y) has continuous partial derivatives up to the 
second order. Then: 

(1) the function f(x, y) has a maximum at Мо(хо, yo) if at that point 
the determinant 

fu Xo, Yo) foyo, Yo) 
кубхо, Yo) Jyo, Yo) 


= (хо, Yo) fy (Xo, Yo) — fiy o, Yo) 50 
and f;j,(Xo, yo) < 0 (y, (xo, yo) < 0); 
(ii) the function f(x, y) has a minimum at Mo(%, yo) if 
D = f£, Yo) fy Xo, Yo) — Лу Оо, Yo) > 0 
and falo, уо) > 0 (fy, (Xo, yo) > 0), 
(iil) the function f(x, y) has no extremum at Mo(%, yo) if 
D = fz (Xo, уо) убо, Yo) — fi (хо, yo) > 0. 


If D = 0, then at the point Мо(хо, yo) the function f(x, y) may or may 
not have an extremum. Further examination is then needed. 

ч We will only prove items (i) and (ii) of the theorem. We write Taylor's 
formula of the second order for f(x, y): 


Jo + Ax, yo + Ay) = Јо, yo) + ЛО, Yo) Ax 
+ fio, yo)AY + i Их, yar? + 2f2(, у) Ax Ay 


+ fox, yay] oos 


+ бду 


x 
y 
where 0 < 0 « 1. 
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As stated f;(Xo, yo) = 0, fy(xo, Yo) = 0, so that 
Af = f(Xo + Ax, yo + Ay) — f(xe, yo) 


1 ” ” ” 
=5 UG, у)Ах? «2/56, у)Ах Ay thy YAY) us + ox (9) 
y = yo + Ody 


It is seen that the sign of the increment Af is determined by that of the 


trinomial on the right of (ж), i.e., by the sign of the second differential d?f. 
We denote for short 


A = fax, У), В = fo у), € = Јууб(х, у) 


and write (ж) in the form 


1 
Af = 5 (A Ах? + 2B Ax Ау + C Ay) 2222-0648 (жж) 
у = yo + Ody 


Suppose that at Мо(хо, yo) we have AC — В? > 0, ie, 


7 (хо, Yo) — f (Xo, Yo) — Say (Xo, Yo) > 0. 
Since the partial derivatives of the second order of f(x, y) are continuous, 
the inequality AC — B? > 0 will hold within some neighbourhood of 


Mo(Xo, уо). 

When AC – B? > 0, A = f(x, y) #0 at the point Mo(xo, yo), and 
therefore within some neighbourhood of М, since f; (x, y) is continuous, 
it has a constant sign that coincides with that of A at (xo, yo). 

But in the region where A ; 0, we have 


A Ax! +2B Ax Ay + Cay’ =} КА Ax + B Ay}? + (AC — B?)Ay?]. 


It is seen that for AC — B? > 0 in some neighbourhood of the point 
Mo(Xo, уо) the sign of the trinomial А Ax? + 2B Ax Ay + C Ay’ coincides 
with the sign of A at the point (xo, yo) (and also with the sign of C, because 
‘A and C cannot have unlike signs for AC — B? > 0). 

Since by virtue of (**) the sign of А Ах? + 2B Ax Ay + C AY? at the 
point (xo + 0 Ax, yo + 0 Ay) determines the sign of the difference 


Af = f(x + Ax, уо + Ay) — f(xo, yo), 


we come to the following conclusion: if for the function f(x, y) at the sta- 
tionary point (хо, yo) the conditions AC — B? > 0 and A < 0 (C < 0) hold, 
then for all sufficiently small |Ax| and |Ay| we have 


Af = f(X% + Ax, yo + Ay) — Д, Yo) < 0, 


and so at (xo, yo) the function has a maximum. 
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If at the stationary point (хо, уо) AC — B? > 0 and A > 0 (C > 0), then 
for all sufficiently small |Ax| and |Ay| we have 


and so at (Хо, yo) the function f(x, y) has a minimum. 9» 
Examples. (1) Examine for extremum the function 


z =x + 2у? - 2х + 4y - 6. 


*4 Using the necessary conditions for an extremum, we find the stationary 
points. For this purpose, we find the partial derivatives 0z/dx and 92/ду 
and equate them to zero. 

We obtain the system of equations 


дї 

oy = 25-2 = 0, 
az 

—— = 4у + 4 = 0. 
ду 34 


Hence x = 1, у = —1, so that М(1, —1) is a stationary point. 
We now make use of the Theorem 12.12. 

















We have 
2 2 
A = 9 =2 B| = 9: - 0, 
м OX |m M, дхду |м 
8?^z 
el ume =a 
Mo ду? М 








so that (АС - B°) | m = 8 > 0. 

Therefore, at Mo we have an extremum. Since A |м, = 2 > 0, this is 
a minimum. 

If we thansform z to the form 


z-(x- y +2y4+ 1° - 9, (+) 
we can easily see that the right side of (*) will be minimal when x = 1, 
y = —1. This is an absolute minimum of the function. 9» 


(2) Examine for extremum the function z = xy. 
ча We find the stationary points of the function, for which purpose we 
make up the system of equations 


9: 
rdg e 
S39 24:99 
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Hence x = y = 0, so that Mo(0, 0) is a stationary point. Since 














2 2 
A | = oe =0,B 292 ES 
Mo дх Mo Mo Ox ду Mo 
8": 
С = ду? = 0, 
Mo Mo 








(AC - B?),, = –1 < 0, and by Theorem 12.12 at the point Mo(0, 0) 
there is no extremum. 9» 

(3) Examine for extremum the function z = x* + у“. 
ча We find the stationary points of the function. From the system of 
equations 


Oz — 3 _ 
zg TE = 0, 
д4 3 
— = 4y = 0, 
ду у 


we get x = у = 0, so that the point Mo(0, 0) is a stationary point. Further, 
we have 




















2 2 
A| - f =0,в| 205 z 
Mo ax Mo Mo Эх ду Mo 
2 
= ды =й 
Mo ay” |м 








so that (AC — B?)| м„ 0 and the theorem gives us no answer as to 
whether or not there exists an extremum. 

Therefore, we proceed as follows. For the function z = x* + у“ at all 
points M(x, y) other than Mo(0, 0), we will have 


Af, 0) = f(x, у) — 0, 0) = x* + »* > 0, 


so that, by definition, at the point Mo(0, 0) the function z has an absolute 
minimum. 

Reasoning along the same lines, we establish that the function 
z= -x' — у“ at the point O(0, 0) has a maximum, and the function 
z= х — y* at O(0, 0) has no extremum. 9 


Conditional extremum. Up to this point we have been dealing with local 
extrema of a function whose arguments are not subject to any additional 
conditions. Such extrema are known as unconditional ones. 

However, problems are quite common which deal with conditional extre- 
ma. Take a function z — f(x, y) defined on a region D. Suppose that on 
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this region a curve L is defined and we want to find the extrema of 
f(x, у) only among those values which correspond to the points of the curve 
L. Such extrema are called conditional extrema of the function 
z= f(x, y) on L. 

Definition. We say that at the point Mo(xo, yo) lying on L the function 
f(x, y) has a conditional maximum (minimum), if the inequality 
Дх, y) < Лха, yo) (or f(x, у) > (xo, yo)) holds at all points M(x, у) on 
the curve L that belong to some neighbourhood of the point Mo(xo, yo) 
and are different from Мо (Fig. 12.21). 





Fig. 12.21 Fig. 12.22 


If the equation of the curve L is ф(х, у) = 0, then the problem on find- 
ing a conditional extremum of z = f(x, y) of the curve L can be formulated 
as follows: find extrema of the function z = f(x, у) in D under the condi- 
tion that g(x, y) = 0. 

Therefore, in finding conditional extrema of the function z = f(x, y) 
the arguments x and y can no longer be regarded as independent variables. 
They are connected by the relation ф(х, y) = 0, which is known as a con- 
straint equation. 

To clarify the difference between unconditional and conditional extrema 
we will take an example. The unconditional maximum of the function 
z = 1 — х2 — у? (Fig. 12.22) is equal to unity and it is achieved at the point 
(0, 0). Corresponding to it is the point M, i.e., the vertex of the paraboloid. 
We now add the constraint equation y — 1/2. The conditional maximum 
will, clearly, be equal to 3/4. It is achieved at the point (0, 1/2), and cor- 
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responding to it is the vertex M, of the parabola, which is the line of inter- 
section of the paraboloid with the plane y — 1/2. In the case of the 
unconditional maximum we have the maximum z-coordinate among all the 
applicates of the surface z = 1 — x? — y?; in the case of the conditional 
maximum we have the maximum Zz-coordinate only among those for the 
straight line y — 1/2 in the xy-plane. 

One method of finding a conditional extremum of the function 


z = f(x, y) 
with the constraint 
e(x, y) 20 


is as follows. 

Suppose now that the constraint equation e(x, y) = 0 defines y as a 
single-valued differentiable function of x: y = y(x). Substituting у(х) for 
y in z — f(x, y), we obtain a function of one argument 


= JG, V0) = F(x), 


which takes into account the constraint condition. 

The extremum (unconditional) of F(x) is the desired conditional ex- 
tremum. 

Example. Find the extremum of the function 


subject to 
x+y-1=0. (жж) 


<4 From the constraint equation (++) we find y = 1 — x. Substituting this 
value of y into (*), we obtain a function of one argument x: 
z-x + (1 – xy. 

Examine it for extremum: z' = 2x — 2(1 — x), hence x = 1/2 is a criti- 
cal point; z” = 4 > 0, so that x = 1/2 (у = 1/2) is a conditional minimum 
of z (Fig. 12223). » Б 

There is another method of solving the problem on conditional ехіге- 
mum, called the method of Lagrange multipliers. 
< Let Mo(xo, yo) be a point of conditional extremum of a function 
д = f(x, у) with the constraint y(x, y) = 0. If we take y to be y(x) defined by 
the constraint equation ф(х, у) = 0, we find that the derivative with respect 
to x of f(x, y) at the point Mo must be zero or, equivalently, the differential 
of f(x, y) at Mo must be zero: 


(df) |м = (лах + Jydy) | Mo = 0. (вен) 
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From the constraint equation we have 
(de) | m = (e dx + ey dy) |м, = 0. 


Multiplying the last relation by a numerical factor АХ yet to be found and 
adding up the result term by term with (ээж), we obtain 


О: + Хе) |м, ах + 0, + А фу) |м, dy = 0. 





Fig. 12.23 


Suppose that А is taken such that 
+o) |м = 0 
(we consider that фу » 0); then, since dx is arbitrary, we have 
О + ^el = 0. 


The last two relations reflect the necessary conditions for an uncondi- 
tional extremum at the point Mo(xo, yo) of the function 


F(x, y) =f, у) + ^ etx, у), 


called the Lagrange function. 
Therefore, a conditional extremum of f(x, y), if ф(х, y) = 0, is bound 
to be a stationary point of the Lagrange function 


F(x, y) = f(x, y) + ^ etx, у), 


where А is some numerical factor. » 
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We thus obtain the following rule to find conditional extrema. 

“То find points that may be conditional extrema of the function z = 
f(x, y) subject to the constraint g(x, у) = 0, we: 

(1) form the Lagrange function 


F(x, y) = Хх, y) + ^ eG, у); 


(2) equate to zero the derivatives 9Р/дх and 9Е/ду of the function and, 
subjecting the resultant equations to the constraint equation, obtain a sys- 
tem of three equations 


OF ; j 2 

EI = fi, y)t^wex у) = 0 

OF 

3y = fx, y) + ^ey(x, у) = 0, (*) 


OF 
9A ын e(x, y) = 0, 


from which we find the values ^ and the coordinates x and y of the possible 
extremum points. 

The question of the existence and nature of a conditional extremum 
is solved by examining the sign of the second differential of the Lagrange 
function 





2 2 
ua ig ФР a ай 
X 


d^F(x, y) = 
у ax ду ay? 














for the set of values xo, yo, ^ obtained from (ж) subject to the condition 
Е ах + A dy = 0 (ах) + (dy)? = 0). 

If ФЕ < 0, then at the point (xo, yo) the function f(x, y) has a condi- 
tional maximum, if d?F > 0, a conditional minimum. 

Specifically, if at a stationary point (Xo, yo) the determinant D for 
F(x, y) is positive: 


Р(х, yo) = Ек (Xo, Yo) F5, yo) i > 0, 


FQ, yo) Fyy(Xo, yo) 


then at the point (xo, yo) /(x, y) has a conditional maximum, if A = 
Е (хо, yo) < 0, and a conditional minimum if A = F.(Q%, yo) > 0. 

We now return to the example on p. 574 where we sought for an extre- 
mum of the function z = x? + y? subject to the condition that x + y = 1. 
We will solve the problem by the method of the Lagrange multipliers. 
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“4 The Lagrange function will then be 
Fœ, у; №\ = 2 + y! + Җх+ у – 1). 
To find stationary points we will construct the system: 


Е = 2х + \ = 0, 
Ру = 2у + \ = 0, 
К =х+у-— 1 = 0. 


From the first two equations of the system we will obtain x = у, and then 
from the third equation of the system (constraint equation) we find 
x = y = 1/2 (the coordinates of a possible extremum). It appears that 
A = —1. The Lagrange function will thus be 


Fx,y;-D-x-y-x-y-l. 


: 2 0 
For it Fy = 2, Fy = 2, Fy = 0, so that D = 0 2 =4>0 and 
F% = 2 > 0, ie, the point Mo(1/2, 1/2) is a conditional minimum of the 
function z = x? + y? subject to the condition x + у = 1. » 

The fact that there is no unconditional extremum for the Lagrange func- 
tion F(x, y) does not yet imply that there is no conditional extremum for 
the function f(x, y) in the presence of the constaint ф(х, у) = 0. 

Example. Find an extremum of the function z = xy subject to the condi- 
tion y – x = 0. 
ча We write the Lagrange function 


F(x, y; X) = xy + AQ - x) 


and also a system to find À and the coordinates of possible extrema: 


Е; = у – \ = 0, 
Ру = х + л = 0, (**) 
Кж у-— х= 0. 


From the first two equations we find x + у = 0 and obtain the system 
b +y=0, 
= х= 0, 
whence х = у = 0. Then, \ = 0. The Lagrange function will thus be 


F(x, y; 0) = xy. 


At the point (0, 0) F(x, y; 0) has no unconditional extremum; however, 
2 = xy has a conditional extremum when y = x. Indeed, we then have 
z =x’, and so at the point (0, 0) there is a conditional minimum. P 
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The method of the Lagrange multipliers can be extended to include the 
case of functions of any number of arguments. 
We wish to find an extremum of the function 


Z= fX, Ха, ..., Xn) (*) 


subject to the constraint equations 


«1G, X2y uu Xn) = 0, 


ea, X2, era Xn) = 0, 


———— Á— : (**) 
€mCa, Хэ, ..., Xn) = 0, 
where т < и. 
We form the Lagrange function 
Fa, Хэ, ..., Xn) = п, Xy, ..., Ха) + Noi, X, ..., Xn) 
+ XexQa, Хэ, ..., X) +... + ХафтОЧь, X, ...‚ Xa) 


where №, №, ..., № are constant multipliers to be found. 

Equating to zero all the partial derivatives of the first order of F and 
attaching to the resultant equations the constraint equations (жє), we will 
obtain a system of n + m equations from which we find Xi, №, ..., № 
and the coordinates xi, Хо, ..., Xn of the possible points of conditional 
extremum. The question of whether or not the points obtained by the 
Lagrange method are conditional extrema can often be solved from physical 
or geometrical considerations. 


Absolute maximum and minimum of a continuous function. We would 
like to find the largest or smallest value of a function z = f(x, y) (ie, its 
absolute maximum or minimum), the function being continuous on some 
closed region D. By Theorem 12.3 in this region there is a point (xo, уо) 
at which the function attains an absolute maximum (minimum). If the point 
(Xo, yo) lies within the region D, then the function f has a maximum (mini- 
mum) in D, so that in this case the point of interest is among the critical 
points of f(x, y). However, the function may also attain its absolute maxi- 
mum (minimum) at the boundary of the region. 

Therefore, to find the absolute maximum (minimum) of the function 
z — f(x, y) in the bounded closed region D we will have to find all maxima 
(minima) of the function within the region, and also at the boundary of 
the region. The largest (smallest) of these numbers will be the desired abso- 
Jute maximum (minimum) of z = f(x, y) in D. We now turn to a differentia- 
ble function. 
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Example. Find the absolute maximum and minimum of the function 
z=x? + у? in the region D(-1«x«l, -1«y«l).. 
8 We look for the critical point of z = x? + y? inside D. To this end, 
we form up the system of equations 











Oz — Е 

асо 

9: 

——— = 2у = 0. 

ду 24 
Hence, x = y = 0, so that the point O(0, 0) is a critical point of z. Since 
az az az 3 

т = 2, 7 = 2, = 0, at O(0, 0) АС -– В = 4> 0 and 

Ox ду дхду 


А = 2 > 0, and so at this point the function z = x? + y? has a minimum 
equal to zero. 





T Fig. 12.24 


We now find the largest and smallest values of the function on the 
boundary Г of D. On the part of the boundary Г, (x = 1, -1 <y <1) 
we have z= 1 + y’, s = 2y, so that y = 0 is a critical point. Since 

2 
A = 2 > 0, at this point the function 2 = 1 + у? has a minimum equal 
to unity. 

At the ends of the segment T, at points (1, —1) and (1, 1) we have 
20, —1) = 2(1, 1) = 2. Using the considerations of symmetry, we can ob- 
tain the same results for other parts of the boundary: I? (у = 1, 
-1<х<1), Гэ (х= -1, -1<у<1) and T; (y=1, -1<х<1]. 
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We finally find that the absolute minimum z = x? + у> in D is zero 
and it is achieved at the internal point O(0, 0) of the region, and the abso- 
lute maximum of the function, equal to two, is achieved at four points 
on the boundary Mi(l, —1), Mod, D, M3(-1, D, Ma(-1, -1) 
(Fig. 12.24), » 


Exercises 


Find the domains of the following functions: 


Lz- 8 75 4МХУу.222-41-3 + N1- y? .3. ze N'sin G2 4 y^). 








I 1 2 2 2 
4. < = +—. 5. = а(х? + у). 6. 2 = ху + Мх? + у> – В? + 
men ( ») y y 
2 - 
ш-у =. 7. а= соїт(х + y). 8. (а) z=Vsinx siny ; (b) z= 


Nsinx — 1 + Nsiny – 1. 


Draw the level lines of the functions 


9.(az-x-y (b) z 2 x^ + y. 
mp: om m m 

10. (а) = = 525 (b) 2 "E 

11. (а) z = Іп (х? + у); (b) z = їп ! xy. 


Find the level surfaces of the functions of three independent variables: 
12. и= х+у+ 2. 13. и= х? + у? – 22. 


Find the limits of the functions: 


- 2 
14, (3) їш 222753. (b) lim —— 
x0 xy x0 + у 
y^0 у-0 
x0 Xy xo x 
y-0 у-2 
: t : x+y 
16. (a lim 1+2}; b) lim ———-. 
(a) xo ( >) b) 2 х? + у? 
yok yoru 
(К = const) 
1 
2 2 —— 
ы хоу 5 e +)? 
17. (a) lim — ——5-; b) lim —,— —-. 
( ) x0 + у? ( ) x70 х? + у? 


yo у-0 
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= has no limit as x > 0 and y > 0. 





18. Show that the function z = - 


Consider the behaviour of the function on the straight lines y — kx. 
Find the sets of discontinuity points for the following functions. 


19. (@) z = yy (b z = Invi? + y^. 


х + у?” 
20 a ge. (b) z= 1 
1—х°—у?, (х = у) 
1 
21. = ——: = E 
1. (a) z = cos ху? (b) z = ху 
1 





22. z = — 5 гж s 
510 тх + sin” Ty 


Find partial derivatives of the functions and their total differentials: 


23. z = х? + y? — 2ху. 24. z = tan! *, 

25. z = e ^"^, 26. z = In (x + In y). 
27. и = xy + yz + xz. 28. u= Ve e y! +z. 
29. z = cosh (х2у + sinh y). 30. и = х. 


Find the derivatives of the composite functions: 


31. (a) z = х? + xy + y?, where x = Г, y = t. Find 28 
i Aon dz 
(b) z = y/x, where x = е, y= 1 — е“. Find ae 
E : д2 dz 
= = 1 
32. (а) z = xe”, where y = tan x. Find ET and de 
: 94 dz 
uy _ y2 = еХ 
(b) z = In f — у?), where y = е". Find ae and ae 
_ -1Х 2 : _ а 92 94 
33. (а) z = tan Е where x = и sin v, y = и соѕ р. Find du and s 
: 94 92 
A 2 Е LA pics 
(D z= х + у“, where х= и + v, у= и — v. Find Эн апа àv 
Using the formula for the derivative of a composite function of two 
variables, find E and i cd of the functions: 
Эх ду 
34. (a) z = f(u), where и = sin ^! xy + » 1 


(b) z = f(u), where и = sin > + en» 
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35. (a) z = f(u, v), where и = x? шу, v = sin^ s 
(b) z = Ди, v), where и = е +, y = tan! Z. 


Find 228 of the functions given implicitly by the equations: 


dx 
36. х2 + y + Іп(х2 + у?) = а. 3. In tan? ee =b. 
DER 1 
38. х2у + sin! +4 = 0. 39. y' = д>. 
y Уу y 


40. Find the slope of the tangent to the curve х? + y? = 10у at its intersec- 
tion with the straight line x — 3. 
41. Find the points where the tangent to the curve х? + y? + 2x — 
2)-2- » is parallel to the x-axis. 
Find —- and — д. 
ду 
42. NC. ME + zcosx- 1. 
2 2 

x y М 

43. 2 + pi + =) = 1. 


Write the equations of the tangent plane and normal to the surfaces: 


44. 2= х? + DA at the point (1, 1, 3). 
Хх. m. ag . 
45. E + E + P = 1 at the point (Xo, Yo, zo). 
46. z = sin x cosy at the point (1/4, 1/4, 1/2). 
47. z= x7 + y? + 2xy at the point (1, 1, 4). 
48. x? + y? + xyz — 3 = 0 at the point (1, 1, 1). 
49. Form the equations of the tangent planes to the surface x? + 2y? + 
3z? = 21 parallel to the plane x + 4y + 6z = 0. 
Find three or four first terms in the Taylor expansions of the functions: 





50. f(x, y) = e' cosy in the neighbourhood of the point (0, 0). 
51. f(x, y) = е" In(1 + y) in the neighbourhood of the point (0, 0). 
52. f(x, y) = x” in Es neighbourhood of the point (1, 1). 
-1 - у 

53. f(x, y) = tan = Y 
54. f(x, y) = e**" in the neighbourhood of the point (1, —1). 

Using the definition of the extremum of a function examine for extre- 
mum the following functions: 
55. z = 1 — (x – 2)! - (y — 3)* at the point (2, 3). 
56. z = (x — 2)* + (y — 3)* at the point (2, 3). 
57. z = x* — y* at the point (0, 0). 


in the neighbourhood of the point (0, 0). 
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58. z = sinf x — (y — 1)‘ at the point (0, 1). 
Using the sufficient conditions for the extremum of a function of two 
variables, examine for extremum the functions: 


59. z= 2y - x! – y. 60. z = х2 – 2х + у? 

61. z = 2xy — 4x — 2y. 62. z= х? + 8) — бху + 1. 

63. z = e" (x + y?) 

64. Find the absolute maximum and minimum of the function z = x? — y 
on the closed circle x? + y? < 1. 

65. Find the absolute maximum and minimum of the function 
z = х?у(4 — x — y) in the triangle bounded by the straight lines x = 0, 
у= 0, х+у = 6. 

66. Find the dimensions of а rectangular open pool having the smallest 
surface area, if its volume is V. 

67. Find the dimensions of a rectangular parallelepiped that for a given 
total surface S has a maximum volume. 


2 


Answers 
0<х<2, —2<х<0, 
1. So 5^" and 42775 7^ 2. The square formed by the segments of the 
»20, y « 0. 
straight lines x = +1 and y = +1, including its sides. 3. The family of concentric circles 
2zk € x! + y? < (2k + 1)т, k = 0, 1, 2, .... 4. The entire plane save for the points on 


the straight lines y = x and y = 0. 5. The part of the plane above the parabola y = -x 
6. The points on the circle x? + y? = R?. 7. The entire plane save for the straight lines 
x+y=n,n=0, +1, +2, 


8 Т yield 
. (a) siny > 0, which yields 


2kr < х < (2k + Ї)їл, k =0, xl, 2, ..., 
2тт < y < (2m + Dy, m = 0, +1, +2,.... 


sin x « 0, Бараа 

r sin y < o, “HS yields 
Qk- 1)т «x «2kz, К 20, xl, +2, ..., 
(2m – Dr & y < 2m7, m = 0, +1, +2, ..., 


The domain is the hatched squares (Fig. 12.25) 


b sinx — 1 = 0, ЕТУ 
(b) ХЭ ae yields 


мама 


Хк + 2Кт, К =0, +1, +2, 


Ут = + 2тт, m=0, +1, +2, 


The function is defined at the points Mim = (хк, Ym). 9. (a) Straight lines parallel to the 
line x + y = 0; (b) concentric circles with centre at the origin of coordinates. 10. (a) Parabo- 
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Fig. 12.25 


las у = Cx’; (b) parabolas y = СУх. 11. (a) Parabolas y = C — x? (C > 0); (b) Hyperbolas 
xy = C, where |C| < 1. 12. Planes parallel to the plane x + y + z = 0. 13. For u > 0 one- 
sheet hyperboloids of revolution about the z-axis; for u « 0 two-sheet hyperboloids of revolu- 
tion about the z-axis; both families of surfaces are separated by the cone x? + y? — 
z? = 0. 14. (a) —1/4; (b) 0. 15. (a) 1; (b) 2. 16. (а) е“; (b) 0. 17. (a) no limit; (b) O. 


dox- l-k en Fork= —1 we have lim z = 0, 





18. We put y = Kx, then z = 


-kx 1+К gos. 
for k = 1/2 lim z = 3, and for k = 3 limz = —2, so that the given function has no limit 
x70 x0 


at the point (0, 0). 19. (a) The point (0, 0); (b) the point (0, 0). 20. (a) The discontinuities 
form the circle x? + y? = 1; (b) the discontinuities form the line y = x. 21. (a) The points 
of discontinuities lie on the x- and y-axes; (b) @ (empty set). 22. All points (т, n), where 

















à 
m and n are integers. 23, SE = ax? — 2y; SE = By - 25; dz = (3x? - 2y) dx + Uy - 
dx — xd à 
x) dy. 24. az = = : дг = — = zid = y > Бае 2 EA -le.e. 
ду хїжу! ду ху ху ax y 
—x/y à 
LT e 5 z~ (-y dx + x dy). 26. 23 : pda. 1 ; 
у y y ax х+1пу ду у(х + In y) 
у ах + dy ди ди ðu _ x Jdücd ud 
= seh —-ytZ;-——-XtZ ——-Xt*y х) ах + 
4 у(х + In y) ax PA ду А az 
X ‚ ди y . ou 








9 
dy + (x + y)dz. 28. - ‚== ; 
(x + z) dy + (x + у)аг Эх LATA ду ere. 3; 


Exercises 585 





а 


5 ;du- BOUE OE 29. дс. sinh (x?y-- sinh y) 2xy; 
Nx! + у? + 2? М? + у? + 2? ax 
sinh (х2у+ѕіпћ y)(x? + cosh y); dz = sinh (x?y + sinh y)[2xy dx + (x? + cosh y) dy]. 
[yz dx + xz Inx dy + xy In x dz). 31. (a) 40 + 3? + 2t; (b) —2 cosh t. 











30. du = х”! 
az dz x д: 2х dz _ Ax — ye) 
MIU. e ee 1+ ; (b = ; = . 
(a) ax dx 1+ x? m ax КА ах х?* у? 


az 


az y 1 
se = 0; eau 4и; =4 34. = ; 
33. (а) on (b) u E v. o zrol wi 4] 


дг x x д2 x 1 tan 1 
- = f'(u) ; 00 = = и) | соз же... |, 
à | peg | ax yy cos? xy 









































x 1 EE af 1 
25 рш) | -cos Х 5 + еа" —— . 35. (a —— Ixy + —— ; 
ki | у y cos? rd ( 5 a ae Vy — 2 
1 
гэ ъс т. ху 5 = Шоо 
ду ди до war -£L У Ox ди до x+y ду 
af x! c cosy 2 af x x y 
-——e -siny + ——-:———, 36. = -—. 37. = —. 38. = 
ди AI 26 у? 2 y di^ 4 
= TI- yl А 
alse а a 39. y'=, 40. At the point М(-3; 1), 
d-e уму? —~ x + xy xy" !-x' lnx 
az 
y' = 3/4; at the point M2(3; 9), у’ = -3/4. 41. Mi(- V; 3), MxC- 1; —1). 42. p = 
sin x — cos 94: хѕіпу — cosz az сх дї су 
21 SEA У, = v 43. = М = 2 44. 2x + 
cosx- ysinz ду  cosx—ysinz дх ад ду bz 
-1 — 1 - XXo 20 - = 
Mea Ee Oh Iu сс оО eit ee 
2 4 -1 а? p? c Xo yo 
a га 
2-2 = 2t 
46. x-y-2z41-0; MEE T m TA 1⁄2 2 47. 4х+4у-2—4=0; 
E 1 -1 тэс 
с? 
x-1 »-1 z-4 x-1 y-1 2-1 
- - . 48. 3x+3y+z-7=0; - - . 
4 4 -1 dii 3 3 1 


49. x + Ay + 6z + 21 ае 


51. у+ = г ay - у) + = jx» - 3! + 2у3). 52. 1+(х— D+- D(y- 10) + 


l œ- 1-0. 53. Hint: Use the formula tan^! C — = tan-!x — tan?! y, We 


2 1 + ху 
obtain: x- y - (02 — yy + Loe y. 5&1 + [0 - proppe C DO DP, 
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Юю - 0+ (+ DP _ x+y Et Gy» 











3i 1+ 1 T 21 3! . 55. Zmax = 1. 56. Zmin = 0. 
57. No extremum. 58. No extremum. 59. Zmax = 1 at the point (0, 1). 60. Zmin = —1 at the 
point (1, 0). 61. No extremum. 62. Zmin = 0 at point (1, 1/2). 63. 2min = -2 at point 


(— 2, 0). 64. The absolute maximum z = 1 at the points (1, 0) and ( — 1, 0). The absolute minimum 
z = —1 at the points (0, 1) and (0, —1). 65. The absolute maximum z= 4 at the „point 
(2, 1). The absolute minimum z = —64 at the point (4, 2). 66. x=V2V, y=V2V, 


z = L УЗУ. 61. Cube with a side a = 3 


Appendix I 


Elementary Functions 


1. Power functions. The power function y = x^, where a is any real 
number, is defined for all x > 0; this function monotonically increases if 
a > 0 and monotonically decreases if a « 0 as shown in Figs. I.1 and I.2, 
respectively. 





Fig. 1.1 Fig. 1.2 


If o is a positive integer the function y = x? is defined at every point 
of the number line — © < x < +00, The graphs of this function for a = 3 
and a = 4 are shown in Figs. 1.3 and 1.4, respectively. 

If о is a negative integer the function y = x^ is defined for all values 
of x except x = 0 (Figs. L5 and I.6). 


Ifa = Е > 0, where q is odd, is a rational number the function у = х“ 


is defined everywhere on the number line; if g is even this function is de- 
fined for all x 2 0. 

2. Exponential functions. The exponential function y = a*, wherea > 0 
and a з 1, is defined at every point of the number line А. The number 
a is called the base of the power function. This function monotonically 
increases if а > 1 and monotonically decreases if 0 < a < 1 (Fig. 1.7). 

3. Logarithmic functions. The /ogarithmic function y = log, x to the 
base a, where a > 0 and a z 1, is defined of the interval (0, + oo). This 
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y 
y-r^ 
———Á— 
0 r 
Fig. 1.4 





Fig. 1.3 








Fig. 1.6 








Fig. 1.7 
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function monotonically increases if a > 1 and monotonically decreases if 
0 <а<1 (Fig. 1.8). 

The logarithmic function y = log, x is the inverse of the exponential 
function y = ах and vice versa. 





Fig. 1.8 


The logarithmic function to the base a — e is called the natural 
logarithm and is usually denoted by In x; the logarithmic function to the 
base a = 10 is called the common logarithm and sometimes denoted by 
log x. In short log. x = In x and logiox = log x. 

4. Trigonometric functions. The sine function y — sin x is a periodic 
function with period Т = 27 defined for all x. The graph of the sine func- 
tion is shown in Fig. 1.9. 





Fig. I.9 








Fig. L10 


The cosine funcion y = cos x is a periodic function with period T = 2« 
defined for all x (Fig. I.10). The graph of this function is obtained from 


that of the sine function by translating the latter by — > along the x-axis. 
The tangent function y = tan x is a periodic function with period T = т 
defined everywhere except at the points x = (2k + D; (K=0, +1, 


+2, ...) (Fig. 111). 








Fig. 1.1 
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Fig. I.12 


The cotangent function y = cot x is a periodic function with period 
T = т defined everywhere except at the points x = kr (k = 0, +1, +2, 
...) (Fig. 1.12). 

The secant and the cosecant functions are given by the formulas 


sec x = 








and cosec x = q , respectively; they are defined every- 


where except at the points where the denominators vanish. 








Fig. 1.13 
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5. Inverse trigonometric functions. (1) у = sin! x. 

Let us consider the sine function y = sinx on the closed interval 
[- 2/2, 7/2], where this function monotonically increases. Hence, the sine 
function has the inverse function x = sin ^! y defined on the closed interval 
1-1, 1] so that the range of x = sin ^ ! y is the closed interval [— 1/2, 7/2]. 
The graph of y = sin ^! x is shown in Fig. 1.13. 

(2) y = cos™! x. 

Let us consider the cosine function y = cosx on the closed interval 
[0, r], where this function monotonically decreases. On the closed interval 
1-1, 1] there exists the inverse function x = cos ^! y whose values fill in 
the closed interval (0, т]. The graph of y = cos ! x is shown in Fig. 1.14. 
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Fig. L15 


(3)y-2tan'!x. 

Let us consider the tangent function y = tanx on the open interval 
(— 1/2, 1/2), where this function monotonically increases. Then the range 
of y = tan x is the interval (— oo, + oo) so that there exists the inverse func- 
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tion x = tan" ! y defined at every point of the number line; the values of 
x = tan ^ ! y fill in the interval [— 7/2, 7/2]. The graph of y = tan ^! x is 
shown in Fig. 1.15. 

(4) у = cot ! x. 

As before we consider the cotangent function y — cot x on the interval 
(0, т), where y = cot x monotonically decreases. The range of у = cot x 
is the interval (— ә, +оо) so that there exists the inverse function 
x — cot !y defined at every point of the number line; the values of 
x = cot у fill in the interval (0, т). The graph of y = cot ^! x is shown 
in Fig. 1.16. 1 











мї 








Fig. 1.16 









y=cosh X 





Fig. I.17 
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6. Hyperbolic functions. The Ayperbolic sine is given by 
x LE 


sinhx = E (1.1) 
2 
The domain is the interval (— oo, оо). 
The range is the interval (— œ, oo). 
The hyperbolic sine is an odd function since sinh ( —x) = —sinh x. The 
graph is shown in Fig. I.17. 
The hyperbolic cosine is given by 


ere 











cosh x = 5 (1.2) 
The domain is the interval (— оо, оо). 
y 
d ! y=coth x 
у=їапһ т 
200 2 VA > 
0 ————— 
0 x 
m. MEME m 
-1 
Fig. 1.19 оп 





Fig. 1.20 


The range is the interval [l, oo). 

The hyperbolic cosine is an even function assuming its minima] value 
at the point x = 0. The graph is shown in Fig. 1.18. 

The hyperbolic tangent is given by 


e'—e* 


ё ес) 


sinhx — 
cosh x 





(1.3) 


tanx — 


The domain is the interval (— оо, oo). 
The range is the interval (—1, 1) so that |tanh x| « 1. 
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The hyperbolic tangent is an odd function. The graph is shown in 





Fig. 1.19. 
The hyperbolic cotangent is given by 
X ea 
Са cosh x LE Be (1.4) 
sinh x e—e^* 


The domain is the union (— œ, 0) О(0, со) of two intervals. 
The range is the union (— œ, —1)U(1, со) of two intervals so that 


сой x| > 1. 
The hyperbolic cotangent is an odd function. The graph is shown in 


Fig. 1.20. 
Relations between hyperbolic functions: 


cosh? x — sinh? x = 1, 

cosh х = V'sinh? x + 1, sinhx = Vcosh?x — 1. 
sinh (x + y) = sinh x cosh y + cosh x sinh y, 
sinh 2x = 2 sinh x cosh x, 

cosh (x + y) = cosh xcosh у + sinh x sinh y, 
cosh 2x = cosh” x + sinh? x. 


tanh x + tanh y 


Ка oak inh 
tanh 2x = ee 
1 + tanh’ x 


These relations follow from the formulas (1.1)-(1.4). 
Since sinh x and tanh x are odd functions the above relations yield 


sinh (x — y) = sinh x cosh y — cosh x sinh y 
cosh (x — у) = cosh xcosh y — sinh x sinh у 


tanhx —tanhy . 


lanh 069) «y cant Vim. 


In conclusion we shall prove the relation 
(sinh x + cosh x)" = sinh nx + cosh nx (n € N). 


Indeed, using (1.1) and (1.2), we get 


: е-е" e+e Ty” 
(sinh x + cosh x)” = = + er, EE 
nx -nx e" + ec 
= 2 + 5 = sinh nx + cosh nx. 
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Ellipsoid, 95 
of revolution, 95 

Elliptic cylinder, 93 

Equation(s) 
approximate solution of, 381 
constraint, 573 
of a circle, 15 
of a curve of the second 
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logarithmic, 587 
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branches of, 72 
conjugate, 77 
directrices of, 75 
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equation of, 86 
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properties of, 71-77 
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Implicit function, 550 
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definite, 459 
elliptic, 436 
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indefinite, 410 
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numerical, 498 
by parts, 417, 475 
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geometric interpretation of, 
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Limit(s) of a function 
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operations on, 266 
right-hand, 270 
sequential criterion for, 253 
Limit of a sequence of complex 
numbers, 301 
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direction vector of, 55 
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imaginary, 86 
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51 
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56 
Linear mapping(s), 192 
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rank of, 194 
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Linear operator(s), 197 
adjoint operator of, 209 
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205 
eigenvalues of, 205 
eigenvectors of, 205 
inverse of, 198 
matrix of, 200 
multiplication of, 197 
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Linear space(s), 168 
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in, 168 
basis of, 175 
dimension of, 178 
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linear subspace in, 170 
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174 
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in, 174 
n-dimensional, 180 
properties of, 170 
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examples of, 173-174 
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Linear subspace(s), 170 
direct sum of, 172 
examples of, 170-171 
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orthocompliment of, 189 
sum of, 171 

Logical connectives, 236-237 


Maclaurin's formula, 389, 564 
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Mapping(s) 
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linear, 192 
Mathematical induction, 238 
method of, 238 
principle of, 238 
Matrix(ces), 103 
addition of, 105 
base column of, 140 
base minor of, 140 
base row of, 140 
elementary, 119 
elementary operations on, 
113-122 
identity, 104 
inverse of, 134 
minor of, 125 
multiplication of, 105 
nonsingular, 133 
of linear operator, 200 
of schematic form, 116 
principal diagonal of, 104 
product of, 105 
rank of, 140 
square, 122 
transition, 182 
transpose of, 110 
transposition of, 110 
triangular, 124 
unit, 104 
zero, 124 
Method of simple iteration, 
163-165 
Methods of solving systems of 
linear equations 
direct, 161 
indirect, 161 
iterative, 161 
Mixed product of vectors, 43 
Multiplication of matrices, 
106-109 
laws of, 108 


Newton-Leibniz theorem, 471 
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Number line, 234 
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Operator 
adjoint, 209 
linear, 197 
symmetric, 211 
Ordinate, 14 
Orthocompliment of linear sub- 
space, 189 
properties of, 189-190 
Orthonormal basis, 31 


Parabola, 77 
axis of, 78 
directrix of, 78 
focal parameter of, 78 
focus of, 78 
optical property of, 83 
properties of, 77-81 
standard Cartesian equation 
of, 77 
tangent to, 80 
vertex of, 77 
Parabolic approximation, 498 
Paraboloid 
elliptic, 98 
hyperbolic, 98 
of revolution, 97 
Partial derivative(s), 538 
geometric interpretation of, 
540 
mixed, 559 
Partial fraction, 425 
Plane, 47 
general equation of, 48 
normal coordinate equation 
of, 49 
normal vector equation of, 
47 
Plane curve 
equation of, 66 
Point 
boundary, 531 
critical (stationary), 568 
deleted 5-neighbourhood of, 
235 
6-neighbourhood of, 235 
of inflection, 362 
of loca] maximum, 353 
of local minimum, 353 
of strict maximum, 354 
of strict minimum, 354 
regular (nonsingular), 555 
singular, 555 
Polar axis, 18 
Polar coordinate system, 18 
Polar coordinates 
polar angle of, 18 
pole of, 18 
Polar radius, 18 
Position vector, 31 


Quadratic form, 213 
associated matrix of, 213 
bilinear form associated 
with, 214 
diagonalization of, 219 
law of inertia for, 221 
positive-definite, 219 

Quantifier 
existential, 236 
universal, 236 


Rational function, 424 
proper, 425 
real, 425 

Reflection of axes of coor- 
dinates, 65 

Relations between Cartesian and 
polar coordinates, 18 

Relations between infinitesimals 
and infinities, 265 

Rotation of axes of coordinates, 
64 

Row-vector(s), 103 
linear combination of, 112 
linear dependence of, 112 
linearly independent, 112 
nontrivial linear combination 
of, 112 
trivial linear combination of, 
112 


Space 
n-dimensional real coor- 
dinate, 169 
Euclidean, 183 
linear, 168 
linear complex, 168 
linear real, 168 
unitary, 19] 
vector, 168 
Standard Cartesian coordinate 
system, 67 
Surface(s) 
classification of, 90-95 
conic, 93 
cylindrical, 91 
equation of, 89 
of revolution, 90 
of the second order, 89-90 
Surface(s) of the second order 
equation of, 90 
standard equations of, 95-102 
Symmetric operator 
properties of, 212-213 


System(s) of linear equations, 
143 
augmented matrix of, 144 
coefficients of, 144 
coefficient matrix of, 144 
Cramer's rule for, 154 
method of Gaussian elimina- 
tion for, 148-150 
quadratic, 154 
Scalar product, 34 
basic properties of, 34-36 
Scalar square of a vector, 36 
Scalar triple product, 43 
geometric interpretation of, 
43 
Second-order determinant, 19 
Sequence of numbers, 239 
bounded, 241 
bounded above, 241 
bounded below, 241 
Cauchy convergence criterion 
of, 241 
infinitely large, 242 
limit of, 239 
stationary, 240 
Set(s), 229 
bounded, 235 
bounded above, 235 
bounded below, 235 
connected, 531 
countable, 230 
disjoint, 230 
empty, 230 
equal, 229 
equivalent, 230 
finite, 230 
greatest lower bound of, 236 
infimum of, 236 
infinite, 230 
least upper bound of, 236 
lower bound of, 236 
one-to-one correspondence 
between, 230 
open, 531 
operations on, 230 
proper subset of, 229 
supremum of, 235 
unbounded above, 235 
unbounded below, 235 
unconnected, 531 
upper bound of, 235 
Significant digit, 233 
accurate, 233 
Simpson's approximation, 502 
Subset, 229 
proper, 229 


Taylor's formula, 385, 564 
for functions, 386° 
for polynomials, 385 
Theorem 
Cauchy mean value, 343 
mean value, 341 
Newton-Leibniz, 471 
on construction of a linear 
mapping, 191 
Rolle's, 339 
Third-order determinant, 20 
cofactor of, 22 
minor of, 21 
Total differential 
geometric interpretation of, 
557 
Transition matrix, 182 
properties of, 182-183 
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dinates, 63 
Trapezoidal approximation, 498 
Triangle inequality, 184 


Unit vector(s), 29 
as orthonormal basis, 31 
Unitary space, 191 


Vector(s) 
addition of, 26 
direction of, 557 
direction cosines of, 37 
equivalent, 29 
components of, 30 
coordinates of, 30 
coplanar, 43 
fixed, 24 
free, 24 
linear operations on, 26 
moving position, 48 
normal, 49 
orthogonal, 185 
orthonormal, 185 
position, 31 
scalar product of, 34 
scalar square of, 36 
sliding, 25 
sum of, 26 
unit, 29 

Vector function, 396 
continuity of, 398 
differentiation of, 399 
limit of, 398 

Vector product, 39 

Vector space, 168 

Vector triple product, 45 
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